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EPA leadership 
must be held 
to account 


The agency continues to aggressively 
undermine environmental regulations 
and demoralize its staff. 


hy are those at the helm of the US 

Environmental Protection Agency (EPA) 

continuing to roll back key environmen- 

tal and public-health regulations? Why 

are they making it harder for themselves 
— and future administrations — to craft new regulations 
based on all of the available research? And why, in their 
actions, do they continue to challenge the place of rigorous 
evidence in policymaking? 

We need to keep asking these questions, because, 
although the world’s attention is rightly focused on the 
coronavirus pandemic, in the past six weeks alone, the 
agency’s leaders have accelerated moves to weaken 
research-based regulations. 

They are walking a road that the EPA’s founders — 
who launched the agency 50 years ago — could not have 
imagined the organization would take. In spite of reasoned 
opposition from researchers and many businesses, too, the 
EPA’s leadership — with the tacit support of many Republi- 
cans inthe US Congress — is undermining the very reasons 
for its own existence. This is a travesty. 

The latest target is the well-established body of evi- 
dence that demonstrates how tiny airborne particles 
embed themselves deep in the lungs and increase the risk 
of premature death from respiratory and cardiovascular 
diseases. 

In a report published last September, the EPA’s staff 
reviewed recent literature, including research indicating 
that even low levels of particulate pollution contribute to 
premature deaths in the United States. The report con- 
cluded that such evidence justified reducing the maximum 
permitted concentrations of fine particulate matter from 
12 micrograms per cubic metre of air to between 8 and 
10 micrograms. But on14 April, the agency’s administrator, 
Andrew Wheeler, said that there is no need for change. No 
reasoning was provided for this decision except that: “The 
U.S. has made incredible strides in reducing particulate 
matter concentrations across the nation.” 

Two days later — ina different sphere of regulation — the 
EPA confirmed a change to the valuation methodology for 
regulations curbing the emissions of mercury and other 
pollutants from fossil-fuel-fired power plants — a change 
that could have far-reaching consequences for people and 
the environment. 

Enforcing existing mercury standards costs the US 
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utilities industry up to US$10 billion annually. In 2015, the 
administration of Barack Obama calculated that, in addi- 
tion to preventing up to 11,000 premature deaths in the 
United States, these standards also save the country inthe 
range of $37 billion to $90 billion a year — money that would 
otherwise be spent on, for example, extra health-care costs. 
But the EPA’s leadership brushed aside the research and 
data that support these findings. The EPA is currently con- 
sidering new guidance that would allow it to ignore such 
comprehensive assessments of comparisons of costs ver- 
sus benefits — leading to potentially weaker environmental 
and public-health regulations in other areas, too. 

On top of all this, in March the administration of US 
President Donald Trump finalized a revised set of targets 
governing greenhouse-gas emissions from new vehicles. It 
slashed the original ambition of 5% in annual reductions to 
1.5%. (By contrast, the European Union’s target is for a27% 
reduction, compared with its 2015 target, in carbon emis- 
sions per kilometre from cars manufactured after 2020.) 


Controlling the science 


Each of these revisions builds on the Trump administra- 
tion’s overarching narrative since 2017: to control the 
research that is used as evidence for the EPA’s decisions. 
This began witha proposed rule that is misleadingly called 
Strengthening Transparency in Regulatory Science. In 
reality, it is a declaration that the EPA intends to limit its 
use of studies for which underlying data and models are 
not publicly accessible. This move has provoked wide- 
spread condemnation from researchers and searching 
questions from the EPA’s ownscience advisory board. The 
rule, if implemented as written, could allow the agency 
to disregard landmark public-health studies for which 
underlying data — such as confidential health records — 
are not accessible for good reasons. 

In all, more than 80 EPA rules and regulations are being 
revised or removed, and researchers have been campaign- 
ing and protesting as never before. They must continue 
to do so, working with allies in industry and across other 
sectors. It can be difficult and frustrating when the doors 
to reasoned engagement are closing. But it is, neverthe- 
less, crucial to push back, so that the agency can be rebuilt 
when — not if — the time comes. 

When the EPA disbanded one of its air-quality science 
advisory panels in 2018, the panel’s members recon- 
vened independently to continue their work, and issued 
a statement advising the agency to strengthen the partic- 
ulate-matter standard. And when the agency refused to 
hold an evidence session on its proposed ‘transparency’ 
rule, members of the Union of Concerned Scientists, a sci- 
ence advocacy group based in Cambridge, Massachusetts, 
organized their own public hearing. 

Some familiar names in industry also no longer wish 
to be associated with regulatory decisions they know 
will be harmful — and are choosing to stand with the 
science. Automobile manufacturers Ford, BMW, Honda 
and Volkswagen have said that they wish to comply with 
the stricter vehicle-emissions standards developed by the 
state of California. Some power corporations that have 
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already invested in meeting existing regulatory standards 
are also unwilling to roll back. 

The EPA is becoming a shell of its former self. Its leaders 
have chosen to abdicate leadership, disregard evidence 
and expose the country’s environment and health to risk 
of further degradation. Many staff have understandably 
chosen to leave. Those who remain should know that they 
have supporters around the world, rooting for them and 
for the EPA — and hoping that, one day, it will once again 
become an evidence-based body that lives up to its name 
and secures its founders’ legacy. 


Share lessons on 
lifting lockdowns 


Countries are starting to lift restrictions 
on movement, but there is little consensus 
on how this should be done. 


hen can lockdowns be lifted, and how 

should this happen? These urgent 

questions are being asked in each of 

the 180 or so countries and territories 

that enacted full or partial restrictions 
on movement in response to the new coronavirus. But as 
countries start tentatively to reopen schools, businesses 
and public places, it is also becoming clear that there is 
little consensus on how this should be done, because 
efforts to coordinate actions globally are running into 
the ground. 

The need to get countries back to work is, of course, 
urgent. According to the International Labour Organiza- 
tion (ILO), an agency of the United Nations, stay-at-home 
orders have imperilled the lives and livelihoods of at least 
1.6 billion people. The majority are workers who have no 
income or health protections and whose work cannot be 
done from home. No work means no food, and no money 
to pay rent. Governments have stepped in with various 
forms of temporary financial assistance, but they have 
also taken on large debts to provide this support. 

Researchers are warning that when this financial 
support ends, the ill health induced by poverty could 
worsen. The World Food Programme is warning that 
265 million people will experience acute hunger by the 
end of 2020 — twice the number facing food crises before 
COVID-19 (see go.nature.com/3c8aeb)j). 

All of this is adding to pressure to reopen economies 
and societies. Yet the pandemic has not yet run its course, 
and infections and deaths continue to rise. If restrictions 
are lifted prematurely, the impacts of the virus risk being 
prolonged. 

The World Health Organization published guidance last 
month for countries considering easing some restrictions 
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— advice that is intended to minimize a risk of aresurgence 
of the virus (see go.nature.com/2a4jdmt). There can be 
no one-size-fits-all solution to easing restrictions. How- 
ever, guided by systematic reviews of the peer-reviewed 
literature, and by its decades of experience in dealing 
with infectious diseases, the agency recommends that, in 
any scenario, disease transmission must be controlled — 
otherwise, without a vaccine, there’s a risk that the virus 
will return. In many countries, transmission is uncon- 
trolled and must be reduced so that cases arise only 
sporadically, or in clusters in identifiable regions. This, 
in turn, requires public-health systems to detect, test, 
isolate and treat cases of infection, and to trace people 
who had been in contact with those infected — something 
that would require large teams of contact-tracers. 

The WHO’s guidance, in addition, says that countries 
need to minimize transmission in potential hotspots, such 
as care homes or large gatherings; and that places where 
people gather — for example, schools and workplaces — 
should be provided with equipment to ensure that phys- 
ical distancing can be implemented. The agency is also 
clear — asis the ILO — that governments must continue to 
provide financial and other forms of support to protect 
the most vulnerable groups. 

By 1 May, according to researchers at the University of 
Oxford’s Blavatnik School of Government, UK, no country 
had met even four of the six WHO criteria that a150-mem- 
ber team convened by the university has been measuring. 
And only 20 countries and territories, including Trinidad 
and Tobago, Croatia, Hong Kong and South Korea have 
come close (see go.nature.com/2zbslef). 

This is largely because most countries (more than 150) 
have not yet met the first requirement — for COVID-19 to 
be reduced to sporadic cases and identifiable clusters. 
Moreover, at least 50 countries still lack adequate pol- 
icies for testing, contact-tracing and isolating infected 
individuals. Such measures, along with physical distanc- 
ing, have also been essential in helping to keep China’s 
cases under control since travel restrictions were lifted 
in February — as research published this week in Nature 
confirms (S. Lai et al. Nature https://doi.org/10.1038/ 
$41586-020-2293-x; 2020). 

On 4 May, the international community — led by 
European Union member states and global philanthropic 
funders — raised US$8 billion for research and develop- 
ment on COVID-19 and pledged to work with the WHO to 
accelerate efforts to develop vaccines and drugs. Such 
collaboration is essential and is to be welcomed, because 
it will ensure that communities in all countries can even- 
tually be protected from the disease. But paying attention 
to the WHO’s guidelines, and sharing best practice on 
easing lockdowns, is essential, too. 

Countries are having to make difficult decisions as 
lockdowns are eased. And they are doing so while their 
researchers are still gathering and processing data. These 
data should be published and shared. Circumstances in 
individual countries will necessarily be different, but the 
world will benefit from the mutual learning that will come 
fromsuch a global effort. 
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A personal take on science and society 


World view 


By Ashley Shew 


Let COVID-19 expand 


awareness of disability tech 


The pandemic’s disruption shows 
how much academia could learn 
from the disability community. 


isabled people including myself have long cam- 
paigned for accommodations to help us live 
our lives. The COVID-19 pandemic shows that 
these are not as impractical as we have always 
beentold. Supermarkets, restaurants and phar- 
macies (even outside cities) can deliver; remote working, 
medicine and education are possible for many; and social 
lives can be rewarding without requiring us to leave home. 

All around me, I see academic colleagues adopting 
disability-led hacks and long-sought accommodations. 
I wish everyone had thought about these workarounds — 
and about disabled people at all — earlier. When lockdowns 
end, we must not forget these lessons, not least because 
the pandemic will disable people, and the impacts will be 
felt most by the most vulnerable parts of society. 

Academiais paying for its ableism. At many universities, 
in-person research with human participants and in labora- 
tories has been curtailed. If these projects had considered 
disabilities, they might be better off: disabled academics 
already plan in short increments, with built-in flexibility. 

In 2014, I returned to my job as an assistant professor, 
newly multiply-disabled — a hard-of-hearing amputee 
battered by chemotherapy and more. | felt out of place. 
I could no longer access many spaces, including most of 
my colleagues’ offices, and I sought the camaraderie of 
other disabled faculty members, staff and students. My 
disabled comrades and I recognize the diversity of dis- 
abilities: congenital and acquired; ranging from cognitive 
to sensory, mobility and more; apparent and not. Many of 
us pursue research that emphasizes how disabled people 
are the best experts on the technologies and structures 
that meet our needs. 

One of my projects examines accounts of disabled 
people’s lived experiences with technologies, and howthey 
differ from those of the scientists and engineers behind the 
tech. I planned my work intending to recruit disabled stu- 
dents among my researchers. Most of the studies can be 
done remotely, even from bed, and ona funky, asynchronous 
timetable as needed. Last year, when I had lung surgery, 
my group shifted gears without worry. And because of its 
disability-led design, my team’s project is pandemic-proof. 

Another project, to gauge the experiences of students in 
civil-engineering classes, was designed to include partici- 
pants with a range of disabilities. So we obtained approval 
for flexibility in communication format: we conduct our 
interviews by text, e-mail, Zoom and other means. Because 
we planned for disabled people to lead and participate in 
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the research, we're well prepared for the current situation 
— or for any other. 

Many disabled people are also adept at managing our 
energy, and forgiving ourselves for not always meeting 
conventional metrics of ‘productivity’. My non-disabled 
colleagues are now struggling to adjust, but my team 
appreciates that ‘clocks should bend to our bodies’, not the 
other way around. Some disabled people call this concept 
Crip Time, reclaiming a derogatory term in pride (much 
like ‘queer’ for many LGBT+ people). 

The disability community creates and lobbies for technol- 
ogies and infrastructure that work better for all. Deaf and 
disabled people fought hard for things such as captioning 
ontelevision, which has since become ubiquitous in sports 
bars and airports and can now be appreciated by people 
streaming media while those they live with rest or work. 

The bitter irony is that, at the moment when non-disa- 
bled — or not-yet-disabled — people are beginning to nor- 
malize these disability hacks and hard-won infrastructure, 
society’s disregard for disabled people is clear. We are 
dying of COVID-19 in greater numbers than are non-disa- 
bled people, in rehabilitation facilities, state institutions 
(including prisons), group homes and care homes. 

Many accommodations demanded under COVID-19 were 
implemented within weeks, including the ability to work 
from home, to have flexible schedules, to get what we need 
without excessive and demeaning documentation, to share 
and celebrate creative adaptation, to work with the knowl- 
edge that all schedules can change. These are all things 
that disabled and chronically ill people have wanted for a 
very long time. I hope that when we’ve flattened the curve 
and saved as many people as possible, we don’t return to 
a world in which disabled people are ignored (especially 
when COVID-19 will probably produce more of us). 

So start making changes that should have been standard 
allalong. Plan creatively and accessibly to allow more work 
offsite, and to include people whose clocks aren't steady. 
Welcome suggestions from disabled colleagues and stu- 
dents about how to make the environment work best for 
their neurotype and schedule. Be ready to take criticism: 
too often, work is set up as dictated by convention, rather 
than by calling on relevant experiences and possibility. 

Make your teaching and scholarly materials multi-modal: 
produce formats that work for people with different phys- 
ical conditions and ways of reading and communicating, 
sharing and contributing. Think about multiple ways to 
allow participation in your funding, reviewing, research 
and engagement. Let’s see an end to patronizing objecti- 
fication and assumptions about what we want and need: 
include disabled people in boards, teams and studies, and 
learn from us. We have had to become masters of invention. 
The pandemic has made the value of that clearer than ever. 


Nature | Vol 581 | 7 May 2020 | 9 


© 2020 Springer Nature Limited. All rights reserved. 


WATER: EDUCATION IMAGES/UNIVERSAL IMAGES GROUP VIA GETTY; REMDESIVIR: ZHANG YAZI/CHINA NEWS SERVICE VIA GETTY; GRAPHIC DATA SOURCE: RESIN/UNIV. SOUTHAMPTON 


The world this week 


Newsin brief 


SCIENTISTS WARN: 
DON'T DRINK THE 
STAGNANT WATER 


As some regions prepare to 

lift the lockdowns imposed in 
response to COVID-19, scientists 
who study water are worried 
about the potential fora 
secondary health crisis waiting 
inthe dormant plumbing of 
offices, gyms, restaurants and 
schools, which could house 
increased levels of pathogens 
and heavy metals. 

“There really isn’t alot of 
scientific awareness of these 
larger buildings,” says Andrew 
Whelton, an environmental 
engineer at Purdue University 
in West Lafayette, Indiana. 
“Because of that, there’s no 
guidance.” 

When Purdue’s campus shut 
down in March, Whelton’s 
laboratory began sampling 
water in several large buildings 
across the university. 

Water quality ina complex 
building can vary by season, by 
time of day or even from room 
to room. So one:size-fits-all 
guidelines for flushing systems 
are unlikely to help, says Purdue 
environmental engineer Caitlin 
Proctor. The most common 
advice is to run all taps at full 
flow for aset amount of time 
— usually 5 or 10 minutes — to 
flush the stagnant water from 
the pipes. 

But one building that the 
Purdue team studied required 
more than a full day to flush. 


DRUG REMDESIVIR AUTHORIZED 
TO TREAT CORONAVIRUS 


One of the world’s best hopes 
for treating COVID-19 —a 
compound called remdesivir 

— has been authorized asa 
therapy against the disease. On 
1 May, the US Food and Drug 
Administration (FDA) granted an 
‘emergency-use authorization’ 
for clinicians to use the 

drug, which is administered 
intravenously, in hospitals for 
people with severe COVID-19. 

The authorization came after 
results from the largest and 
most rigorous clinical trial yet of 
the compound showed that the 
antiviral could shorten the time 
to recovery from coronavirus 
infection. Several smaller trials 
had reported mixed results in the 
past few weeks, but some had no 
placebo-controlled arm, making 
the results hard to interpret. 

On 29 April, Anthony Fauci, 
director of the US National 
Institute of Allergy and Infectious 
Diseases (NIAID), said that the 
latest clinical trial, involving 
more than 1,000 people, showed 
that those taking remdesivir 
recovered in 11 days on average, 
compared with 15 days for those 
onaplacebo. “Although a 31% 
improvement doesn’t seem like 
aknockout 100%, it isa very 
important proof of concept,” 
Fauci said. “What it has proven is 
that a drug can block this virus.” 
He added that remdesivir would 


become a standard treatment for 
COVID-19 in the United States. 

Remdesivir, which was 
previously tested against Ebola, 
interferes with an enzyme that 
some viruses use to replicate; 
these include SARS-CoV-2, which 
is responsible for the current 
pandemic. 

The latest trial results , which 
have not yet been published in 
a peer-reviewed journal, also 
showed that there were fewer 
deaths among participants who 
received the drug, Fauci said, but 
that trend was not statistically 
significant. However, the 
shortened recovery time was 
significant and beneficial 
enough for investigators to stop 
the trial early to ensure that 
participants receiving a placebo 
could access the drug. 

The FDA's authorization is not 
a final drug approval, and can 
be revoked when the conditions 
required for emergency 
use are no longer in effect. 

US distribution of the drug will 
be government-controlled. 

“It may not be the wonder 
drug that everyone’s looking 
for, but if you can stop some 
patients from becoming 
critically ill, that’s good enough,” 
says virologist Stephen Griffin at 
the University of Leeds, UK. The 
drug’s maker is Gilead Sciences 
in Foster City, California. 


HISTORICAL SPEND 
ON CORONAVIRUS 
RESEARCH WAS TINY 


Before the COVID-19 pandemic, 
funding for coronavirus research 
made up just 0.5% of global 
infectious-diseases spending 
by public and philanthropic 
research organizations. 

From 2000 to the start of 
this year, about US$550 million 
was spent on coronavirus- 
related research, reveals an 
analysis by researchers at the 
University of Southampton, UK. 
By comparison, Ebola-related 
research got $1.2 billion (1.1%). 

Spending has risen to 
$985 million since the current 
outbreak began. Some 
$275 million of COVID-19 
research funding is focused 
on vaccine development; 
$40 million is focused on 
therapeutics and $18 million 
on diagnostic tests. The figures 
cover spending by more than 
1,000 funders worldwide. 

The researchers note that the 
spending has generally been 
reactive — explaining spikes in 
2004 and 2015, after outbreaks 
of the coronaviruses that cause 
the diseases SARS and MERS, 
respectively. 


CORONAVIRUS CASH 


Public and philanthropic bodies have 
spent US$985 million on coronavirus 
research since 2000. But before 2020, 
this constituted only 0.5% of total 
spending on infectious-disease research. 


$569 m 


$416 m 


Other coronaviruses COVID-19 
2000-20 (2020) 
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The world this week 


News in focus 


WHOSE CORONAVIRUS STRATEGY 


WORKED BEST? SCIENTISTS HUNT 


MOST EFFECTIVE POLICIES 


Researchers sift through data to compare 
nations’ vastly different containment measures. 


By Elizabeth Gibney 


ong Kong seems to have given the 
world a lesson in how to curb COVID- 
19 effectively. With a population 
of 7.5 million, it has reported just 
4 deaths. Researchers studying its 
approach have found that swift surveillance, 
quarantine and social-distancing measures, 
such as school closures, helped to cut coro- 
navirus transmission — measured by the aver- 
age number of people each infected person 
infects, or R — to close to the critical level of 
1 by early February. But the paper, published 


in April, couldn’t tease apart the effects of the 
various measures and behavioural changes 
happening at the same time (B. Cowling et al. 
Lancet Public Health 5, e279-e288; 2020). 
Working out the effectiveness of the meas- 
ures implemented worldwide to limit the coro- 
navirus’s spread is now one of scientists’ most 
pressing questions. Researchers hope that, 
ultimately, they will be able to accurately pre- 
dict how adding and removing control meas- 
ures affects transmission rates and infection 
numbers. This information will be vital to gov- 
ernments as they design strategies to return 
life to normal, while keeping transmission low 
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to prevent second waves of infection. “This is 
not about the next epidemic. It’s about ‘what 
do we do now?” says Rosalind Eggo, a math- 
ematical modeller at the London School of 
Hygiene and Tropical Medicine (LSHTM). 
Researchers are already working on mod- 
els that use data from individual countries to 
understand the effect of various control meas- 
ures. Models based on real data should be more 
nuanced than those that, at the start of the out- 
break, predicted the effects of interventions 
mainly using assumptions. Combining data 
from around the world will allow scientists to 
compare countries’ responses. And this should 
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allow them to design models that can make 
more accurate predictions about new phases 
of the pandemic and across nations. 

But untangling cause and effect is extremely 
challenging, in part because circumstances 
differ in each country and because there is 
uncertainty over how well people adhere to 
control measures, cautions Eggo. “It’s really 
hard but it doesn’t mean we shouldn't try.” 


Pulling together 


Efforts to tackle these questions will soon get 
a boost from a database that brings together 
information onthe hundreds of different inter- 
ventions introduced worldwide. The platform, 
being prepared for the World Health Organi- 
zation (WHO) by ateam at the LSHTM, gathers 
data collected by ten groups already tracking 
interventions. These include researchers at the 
University of Oxford, UK, and at the Complex- 
ity Science Hub Vienna (CSH Vienna). 

The database will standardize the infor- 
mation collected by the different teams and 
should be more comprehensive than anything 
an individual group could generate, says Chris 
Grundy, a data scientist behind the LSHTM pro- 
ject. Agencies such as the WHO routinely track 
control measures used in an outbreak, but for 
COVID-19 the picture is complicated by the 
pandemic’s speed and scale, says Grundy. The 
data set will be open, and improved in future 
releases, he says. Speed is of the essence: “Days 
make a difference right now.” 

The trackers lay bare the vast range of poli- 
cies deployed in different nations. The Vienna 


team has captured details of around 170 inter- 
ventions in 52 countries, ranging from small 
measures such as floor stickers that mark a 
2-metre separation to major policies such as 
school closures. The team is also following 
some countries’ recent efforts to restart nor- 
mal daily life. Oxford’s project, the COVID-19 
Government Response Tracker, is monitoring 
13 interventions in more than 100 countries. It 
compiles 7 of the 13 into a single ‘stringency’ 
index that captures the overall severity of 
each country’s response and allows for com- 
parison between countries’ approaches (see 
‘Pandemic protections’). 

Already, scientists are analysing their datato 
explore differences in responses. The Vienna 


“Ifwe don’t know what 
works, it’s going to be 
really difficult to decide 
what to donext.” 


team is looking for patterns, using methods 
that include clustering countries by how 
early in their epidemics they began introduc- 
ing interventions and by the total number of 
restrictions introduced. In Europe, for exam- 
ple, algorithms group Sweden, the United 
Kingdom and the Netherlands together as 
countries that acted relatively slowly. 
Meanwhile, Germany and Austria stand out 
as nations that adopted aggressive and early 
control strategies, compared with Italy, France 


PANDEMIC PROTECTIONS 


Researchers have created a ‘stringency index’ that describes the overall severity of a country’s 
response to the coronavirus outbreak and allows responses to be compared. The index takes into 
account seven control measures, such as school closures and restrictions on people’s movements. 


Stringency Index: Low == ~ e@ High 


China 

Index was high in 
Hubei but lower 
nationwide 


Germany 
Regional school 
closures began in 
February 


South Korea 
Social distancing, 
and some 
businesses closed 


Italy 

First nationwide 
lockdown on 

11 March 


Total confirmed deaths (logarithmic scale) 


United Hong Kong 
Kingdom Intense 
Lockdown surveillance for 
on 23 March infections 


United States 
Measures vary; 
most states have 
stay-at-home orders 


Sweden 

No lockdown; 
restaurants and 
bars open 


0 50 


100 O 50 


100 O 50 


100 O 50 100 


Days since first confirmed death 


Confirmed deaths undercount true COVID-19 mortality. Stringency index developed by the Oxford COVID-19 
Government Response Tracker. Data downloaded on 21 April; countries vary in day of most recent data update. 
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and Spain, which implemented similar meas- 
ures, including lockdowns, but later in their 
epidemics. So far, Germany and Austria have, 
per capita, seen a fraction of the deaths from 
COVID-19 reported in these other countries. 
Early findings from the Oxford team also 
suggest that poorer nations tended to bring 
in stricter measures than did richer countries, 
relative to the severity of their outbreaks. 
For example, the Caribbean nation of Haiti 
enforced lockdown on confirming its first case, 
whereas the United States waited until more 
than two weeks after its first death to issue 
stay-at-home orders. That might be because 
lower-income countries with less-developed 
health-care systems act more cautiously, says 
Anna Petherick, a public-policy researcher at 
Oxford. It could also reflect the fact that the 
outbreak reached these nations later, giving 
them longer to learn from others, she says. 


Patterns and predictions 


Ultimately, researchers hope to use data 
fromthe LSHTM database to understand how 
effective these strategies were in limiting the 
outbreak. “We really need to evaluate those 
interventions in real time, so everybody can 
make real policies,” says Eggo, who was not 
involved in creating the database but plans 
to use it. “If we don’t know what works and we 
don’t know how much, it’s going to be really 
difficult to decide what to do next.” Eggo and 
her colleagues will use the data to test the 
accuracy of mathematical models — which use 
equations to describe the rate of transmission 
and mechanisms behind it — under varying 
intervention types and timing. 

Ideally, researchers will be able to forecast 
how adding and removing interventions would 
change the number of infections over time. 
Policymakers could use such predictions, 
together with data on intensive-care capac- 
ity, to make decisions — on whether to reopen 
schools, for example — says Nils Haug, a 
mathematical physicist at CSH Vienna and 
the Medical University of Vienna. 

Haug and his team of modellers are explor- 
ing which statistical approaches to use. Rather 
than directly determining the precise effect 
of each intervention, these methods can find 
ways to identify the measures that best predict 
infection rates. One approachinvolves using a 
machine-learning technique called a recurrent 
neural network to learn from patterns in the 
data and make predictions. Scientists can learn 
how important a given intervention is by look- 
ing at how predictions shift when they remove 
information about it from the network. 

Without a vaccine or effective treatment, 
stopping transmission remains the only 
defence against COVID-19. Knowing the 
effects of each control measure is crucial to 
figuring out which ones can be safely altered 
or removed, says Petherick. “I think that would 
bea huge contribution.” 


SOURCE: OXFORD CORONAVIRUS GOVERNMENT RESPONSE TRACKER (DATA); NATURE (CHARTS). 


DANE RHYS/BLOOMBERG VIA GETTY 


A coal-fired power plant in Colorado will not face stricter standards. 


FIVE WAYS TRUMP 


IS UNDERMINING 


ENVIRONMENTAL HEALTH 


While the world focuses on a pandemic, the United 
States is adopting controversial environment policies. 


By Jeff Tollefson 


he US Environmental Protection 
Agency (EPA) turns 50 this year, but 
scientists and environmentalists see 
little reason to celebrate. In the mid- 
dle of a global pandemic, the agency’s 
leadership is pressing forward with efforts to 
roll back environmental regulations and alter 
the way in whichscienceis used to craft policy. 

In the past month alone, the agency has 
dialled down regulations on automobile 
emissions and fuel efficiency; it has weak- 
ened rules on mercury and other pollutants 
emitted by power plants; and it has shied 
away from strengthening standards to reduce 
fine-particle air pollution. 

“This is an extremely aggressive agenda,” 
says Betsy Southerland, who spent more than 
three decades as an EPA Official before retiring 
in protest against the current administration’s 
policies in 2017. 


Two rollbacks on emissions 


At the end of March, President Donald Trump’s 
administration finalized a plan to scale back 
targets for automobile-emissions reductions 
from 5% per year to 1.5%, a change that the 
EPA acknowledges could result in an extra 


867 million tonnes of carbon dioxide being 
released into the atmosphere by vehicles sold 
over the next decade. 

In mid-April, the EPA issued another rule, 
targeting mercury-emissions standards for 
power plants first put in place under the pre- 
vious president, Barack Obama. Although 
the agency left the existing limits in place, it 
adjusted how the rules’ costs and benefits are 
calculated, weakening their economic justifi- 
cation. The original price tag reported for the 
2011 regulation took into account health bene- 
fits from a reduction in particulate matter that 
would accompany cuts to mercury emissions. 

Taking these out of the equation makes 
the rule seem more expensive, says David 
Spence, a political scientist and law scholar 
at the University of Texas at Austin. It also sets 
aprecedent that could undermine the mercury 
rule and others. 


Inaction on particulates 


Even more alarming, public-health experts 
say, was a decision on fine-particle pollu- 
tion that EPA administrator Andrew Wheeler 
announced in mid-April. In that case, the 
EPA went against the advice of its own staff 
and many academic scientists by leaving 
the current standards in place — in spite of 
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evidence that reducing such pollution could 
save thousands of lives each year (Q. Diet al. 
N. Engl. J. Med. 376, 2513-2522; 2017). 

Ina report issued last September, EPA staff 
cited epidemiological and other evidence 
that would support cutting the maximum 
allowed average level of fine particulate matter 
from 12 micrograms per cubic metre of air to 
between 8 and 10. 

The regulatory process that prevented 
that change was tipped towards the interests 
of polluters from the outset, with little to 
no independent scientific oversight, says 
Christopher Frey, an environmental engineer 
at North Carolina State University in Raleigh. 
Frey formerly chaired the EPA’s scientific 
advisory committee on clean air, and was a 
member of a review panel for the issue that 
was disbanded in October 2018. 

“Rather than focusing on protecting public 
health, EPA is on a misguided mission to 
protect the profits of regulated industries,” 
Frey says. “But it’s all based ona lot of miscon- 
ceptions and assumptions, rather than facts 
or evidence.” 


Two proposed rules 


Looking ahead, it could become even more 
difficult to bring health data and other evi- 
dence to the policymaking table if the EPA 
moves forward with a pair of proposals that 
would alter how science is used and evalu- 
ated at the agency. First is a “transparency” 
rule that could restrict the use of public-health 
studies — including much of the epidemiolog- 
ical research that the agency has used to set 
particulate-pollution standards in the past. 

A draft proposal states that if underlying 
data and models are not publicly available — 
often the case for private health-care data — 
the EPA could give them less weight or exclude 
them from consideration when setting stand- 
ards and conducting scientific assessments. 

“It would apply to pretty much all of EPA’s 
major work,” says Michael Halpern, deputy 
director of the Center for Science and Democ- 
racy at the Union of Concerned Scientists, an 
advocacy group in Cambridge, Massachusetts. 
Amendments to the proposal are open to 
public comment until 18 May. 

A second proposal, currently pending 
review at the White House, would change how 
the agency evaluates the costs and benefits 
of environmental and public-health regula- 
tions, much as it did in its re-evaluation of the 
mercury-emissions standards. 

Taken together, the cost-benefit guidance 
and the transparency rules could help the 
Trump administration to justify removing 
regulations, and could hamper regulatory 
efforts by future administrations. These and 
other EPA decisions will be challenged in court, 
but scientists and environmentalists say that 
provides little solace. “They are rolling back 
progress, and weare losing time,’ Halpern says. 
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News in focus 


REFUGEE CAMPS RACE 
TO AVERT CORONAVIRUS 


CATASTROPHE 


From Bangladesh to Greece, researchers and 
aid workers are taking steps to protect people. 


By Nidhi Subbaraman 


here are 70 million refugees, displaced 

people and asylum seekers around 

the world. Close living quarters, sig- 

nificant underlying health problems 

and limited access to sanitation and 
medical care mean that COVID-19 — which has 
brought some of the best health systems in the 
world to their knees — poses an outsize threat 
to these communities. 

Although there are some reports of refugees 
testing positive for the virus, as of mid-April 
there had been no known COVID-19 outbreaks 
in major refugee camps, according to advo- 
cates and responders contacted by Nature. But 
many aid groups fear that it is only a matter of 
time before the disease strikes. According to 
advocacy groups, host nations have been slow 
to enforce preventive measures. And experts 
fear that aid organizations will struggle to rally 
and respond. 


Models in Cox’s Bazar 


Nearly 600,000 Rohingya people now live in 
the Kutupalong-Balukhali Expansion Site in 
Cox’s Bazar, Bangladesh, having fled perse- 
cution in Myanmar. The refugee camp is one 
of the world’s biggest and most densely pop- 
ulated. And it is the first to be used ina model 
of COVID-19, says Paul Spiegel, director of 
the Center for Humanitarian Health at Johns 


An aid worker in Greece distributes masks. 
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Hopkins University in Baltimore, Maryland. 

Spiegel and his team projected outcomes 
at the camp using data, mainly from China, 
on age, case severity and case fatality rates 
for the outbreak (S. Truelove et al. Preprint at 
medRxiv http://doi.org/ggqtq4; 2020). The 
analysis has not been peer reviewed. 

The group modelled outcomes in low-, 
moderate- and high-transmission scenarios. 
The worst-case scenario exhausts the encamp- 
ment’s medical capacity in just 58 days. When 
on-site hospitals are overwhelmed with 
COVID-19, there could be spikes in deaths from 
other diseases, such as malaria. 

A spokesperson for the United Nations 
High Commissioner for Refugees says that the 
organization has used this model to guide its 
response, alongside partner organizations. 


Aplan to move in Greece 


Five Aegean islands off the coast of Greece 
host refugees and asylum seekers from the 
Middle East and south and central Asia before 
they can enter the country. Greece has facili- 
ties for about 6,000 people awaiting asylum 
decisions at these island entry points, but the 
camps have swelled to nearly 40,000 people. 
Many are living inrudimentary encampments 
among the olive groves, with limited access to 
running water and toilets. 

“They are living in tents, makeshift camps 
and carton boxes. It’s a highly overcrowded 
situation,” says Apostols Veizis, director of 
the medical operational support unit in Athens 
for aid organization Médecins Sans Frontiéres 
(MSF, also known as Doctors without Borders). 
Although other parts of the world are prioritiz- 
ing social distancing and lockdowns to slow 
the spread of the coronavirus, the options are 
limited ina refugee camp. “There’s not really an 
ability to self-isolate or quarantine within these 
communities,’ says Devon Cone, asenior advo- 
cate with Refugees International, a humani- 
tarian organization based in Washington DC. 

In response to the pandemic, MSF is prior- 
itizing provision of water and sanitation facil- 
ities to the camps. Veizis says there is a plan to 
protect about 2,400 high-risk residents — those 
over 60 and those with chronic conditions — 
by moving them to mainland hotels. And in 
April, the European Union pledged €350 mil- 
lion (US$377 million) to support refugees and 
asylum seekers, including those in Greece. 
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Q&A 


Mission 
totrace 


Former World Bank president Jim Yong 
Kim is spearheading an ambitious 
US$44-million project to control 
COVID-19 in the state of Massachusetts. 
The effort will involve a legion of contact 
tracers, who will phone people diagnosed 
with COVID-19 and their contacts, in 

the hope of isolating those infected and 
stopping the spread of coronavirus. 


What surprised you about the response to 
COVID-19 in the United States? 

After the first cases of community 
transmission, in Washington state, folks were 
saying it’s too late for containment. What 
surprised me is just how quickly we gave up 
on the standard shoe-leather-epidemiology 
approach to fighting epidemics that has 
been in place for hundreds of years. 


What did you do? 

| reached Charlie Baker, the governor of 
Massachusetts. And he made the decision 
to take a chance on a contact-tracing 
programme for the entire state. We're 
going to hire 1,000 people right away, and 
we'll have to spend more on isolation and 
quarantine. He had to make the decision to 
dip into the state’s funding to do this. 


Do you have any tips for how scientists can 
have their voices heard by policymakers? 

| think it’s very hard to convince others to 
do things that are complicated and labour- 
intensive, unless you've done it yourself, or 
at least started doing it. 


Do you make a financial argument in your 
push to contain COVID-19? 

| spent seven years at the World Bank. | can 
tell you it makes no sense to keep putting 
trillions of dollars into a stimulus package 
and not put hundreds of billions into the 
one thing that will stop the financial crisis: 
getting the virus under control. 

We will probably get a vaccine, but we 
cannot stay in lockdown until that happens. 
The financial impact is just going to be so 
devastating that we have to figure out a 
different way. 


Interview by Amy Maxmen 
This interview has been edited for length 
and clarity. 
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Feature 


ROOTS OF MENTAL ILLNESS 


ILLUSTRATION BY PAWEE JONCA 


Researchers are beginning to untangle the common biology that links 
supposedly distinct psychiatric conditions. By Michael Marshall 


n2018, psychiatrist Oleguer Plana-Ripoll 
was wrestling with a puzzling fact about 
mental disorders. He knew that many 
individuals have multiple conditions — 
anxiety and depression, say, or schizo- 
phrenia and bipolar disorder. He wanted 
toknowhowcommonit was to have more 
than one diagnosis, so he got his hands on 
a database containing the medical details of 
around 5.9 million Danish citizens. 

He was taken aback by what he found. 
Every single mental disorder predisposed the 
patient to every other mental disorder — no 
matter how distinct the symptoms!. “We knew 


that comorbidity was important, but we didn’t 
expect to find associations for all pairs,’ says 
Plana-Ripoll, who is based at Aarhus University 
in Denmark. 

The study tackles a fundamental question 
that has bothered researchers for more than 
acentury. What are the roots of mental illness? 

In the hope of finding an answer, scientists 
have piled up an enormous amount of data 
over the past decade, through studies of genes, 
brain activity and neuroanatomy. They have 
found evidence that many of the same genes 
underlie seemingly distinct disorders, suchas 
schizophrenia and autism, and that changesin 
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the brain’s decision-making systems could be 
involved in many conditions. 

Researchers are also drastically rethink- 
ing theories of how our brains go wrong. The 
idea that mental illness can be classified into 
distinct, discrete categories such as ‘anxiety’ 
or ‘psychosis’ has been disproved to a large 
extent. Instead, disorders shade into each 
other, and there are no hard dividing lines — as 
Plana-Ripoll’s study so clearly demonstrated. 

Now, researchers are trying to understand 
the biology that underlies this spectrum of 
psychopathology. 

They have a few theories. Perhaps there 
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are several dimensions of mental illness — so, 
depending on how a person scores on each 
dimension, they might be more pronetosome 
disorders than to others. An alternative, more 
radical idea is that there is a single factor that 
makes people prone to mental illness in general: 
which disorder they developis then determined 
by other factors. Bothideas are being taken seri- 
ously, although the concept of multiple dimen- 
sions is more widely accepted by researchers. 

The details are still fuzzy, but most psychia- 
trists agree that one thingis clear: the oldsystem 
of categorizing mental disorders into neat boxes 
does not work. They are also hopeful that, inthe 
long run, replacing this framework with one that 
is grounded in biology will lead to new drugs 
and treatments. Researchers aim to reveal, for 
instance, the key genes, brain regions and neu- 
rological processes involved in psychopathol- 
ogy, and target them with therapies. Although 
it might take a while to get there, says Steven 
Hyman of the Broad Institute of MIT and Har- 
vard in Cambridge, Massachusetts, “Iam long- 
term optimistic ifthe field really does its work.” 


Asmorgasbord of disorders 


The most immediate challenge is working 
out how to diagnose people. Since the 1950s, 
psychiatrists have used an exhaustive volume 
called the Diagnostic and Statistical Manual 
of Mental Disorders, currently in its fifth edi- 
tion. It lists all the recognized disorders, from 
autism and obsessive-compulsive disorder to 
depression, anxiety and schizophrenia. Eachis 
defined by symptoms. The inherent assump- 
tionis that each disorder is distinct, and arises 
for different reasons. 

However, even before the DSM-5 was pub- 
lished in 2013, many researchers argued that 
this approach was flawed. “Any clinician could 
have told you that patients had not read the DSM 
and didn’t conform to the DSM,” says Hyman, 
who helped to draft the manual’s fifth edition. 

Few patients fit into each neat set of criteria. 
Instead, people often havea mix of symptoms 
from different disorders. Evenif someone has 
a fairly clear diagnosis of depression, they 
often have symptoms of another disorder 
such as anxiety. “If you have one disorder, 
you’re much more likely to have another,” says 
Ted Satterthwaite, a neuropsychiatrist at the 
University of Pennsylvania in Philadelphia. 

This implies that the way clinicians have 
partitioned mental disorders is wrong. Psy- 
chiatrists have tried to solve this by splitting 
disorders into ever-finer subtypes. “If you look 
at the way the DSM has evolved over time, the 
book gets thicker and thicker,” says Satterth- 
waite. But the problem persists — the subtypes 
are still a poor reflection of the clusters of 
symptoms that many patients have. 

As a result, the world’s largest funder of 
mental-health science, the US National Insti- 
tute of Mental Health, changed the way it 
funded research. Beginning in 2011, it began 
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MENTAL MAP 


Similar genetic variants seem to underlie a number of psychiatric disorders. In one study of 200,000 
people®, schizophrenia was significantly correlated with most other disorders. By contrast, some disorders 
such as post-traumatic stress disorder (PTSD) showed only weak correlations to other conditions. 
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demanding more studies of the biological 
basis of disorders, instead of their symp- 
toms, under a programme called the Research 
Domain Criteria. There has since been an 
explosion of research into the biological basis 
of psychopathology, with studies focusing on 
genetics and neuroanatomy, among other 
fields. But if researchers hoped to demystify 
psychopathology, they still have along way to 
go: the key finding has been just how complex 
psychopathology really is. 


Controversial clusters 


Clinically, the evidence that symptoms cut 
across disorders — or that people frequently 
have more than one disorder — has only grown 
stronger. For this reason, although individual 
symptoms suchas mood alterations or impair- 
ments in reasoning can be diagnosed reliably, 
assigning patients to an overall diagnosis such 
as ‘bipolar disorder is difficult. 

Even seemingly separate disorders are linked. 
In 2008, geneticist Angelica Ronald, then at 
King’s College London Institute of Psychia- 
try, and her colleagues found that autism and 
attention deficit hyperactivity disorder (ADHD) 
overlapped. “At the time, you weren't allowed 
to be diagnosed with both conditions,” says 
Ronald; this was because of a rule in an earlier 
version of the DSM. But she and her team found 
that traits for autism and ADHD were strongly 
correlated, and partially under genetic control’. 

Furthermore, there seem to be clusters of 
symptoms that cross the boundaries of disor- 
ders. A 2018 study? examined people who had 
been diagnosed with either major depression, 
panic disorder or post-traumatic stress disorder 
(PTSD). The volunteers were assessed on the 
basis of their symptoms, cognitive performance 
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and brain activity. The researchers found that 
the participants fell into six groups, character- 
ized by distinct moods such as ‘tension’ and 
‘melancholia’. The groups cut across the three 
diagnostic categories as if they were not there. 
Many now agree that the diagnostic 
categories are wrong. The question is, with 
biology as their guide, what should psychiatric 
diagnosis and treatment look like instead? 


Multiple dimensions 


One prominent model is that there are anumber 
of neuropsychological traits or ‘dimensions’ 
that vary in every person. Each trait determines 
our susceptibility to certain kinds of disorder. 
For example, someone might be prone to mood 
disorders such as anxiety, but not to thought 
disorders suchas schizophrenia. 

This is similar to the way psychologists think 
about personality. In one model, five person- 
ality traits, such as conscientiousness and 
neuroticism, describe most of the variation 
inhuman personalities. 

Some psychiatrists are already trying to 
reimagine their discipline with dimensions 
in mind. In the early 2010s, there was a push 
to eliminate disorder categories from the 
DSM-S in favour of a ‘dimensional’ approach 
based on individual symptoms. However, this 
attempt failed — partly because health-care 
funding and patient care has been built up 
around the DSM's categories. However, other 
catalogues of disorders have shifted towards 
dimensionality. In 2019, the World Health 
Assembly endorsed the latest International 
Classification of Diseases (called ICD-11), in 
which some psychopathologies were newly 
broken down using dimensional symptoms 
rather than categories. 


SOURCE: REF. 8 


The challenge for the dimensionality 
hypothesis is obvious: how many dimensions 
are there, and what are they? Satterthwaite calls 
this “avery large problem’. 

One popular theory*, supported by many 
studies over the past decade, argues for just two 
dimensions. The first includes all ‘internalizing’ 
disorders, suchas depression, in which the pri- 
mary symptoms affect a person’s internal state. 
This is contrasted with ‘externalizing’ disorders, 
suchas hyperactivity and antisocial behaviour, 
in which a person’s response to the world is 
affected. If someone has been diagnosed with 
two or more disorders, studies suggest these 
are likely to be from the same category. 

But studies combining large amounts of 
brain-imaging data with machine learning have 
turned up different numbers — even in studies 
done by the same lab. Last year, Satterthwaite 
andhis group publisheda study’ of 1,141 young 
people who had internalizing symptoms, and 
found they could be split into two groups on 
the basis of their brain structure and function. 
In 2018, Satterthwaite led a similar study° and 
identified four dimensions, each associated 
with a distinct pattern of brain connectivity. 

Ultimately, a future version of the DSM could 
have chapters devoted to each dimension, says 
Hyman. These could list the disorders that clus- 
ter within each, as well as their symptoms and 
any biomarkers derived from the underlying 
physiology and genetics. Two people who had 
similar symptoms but different sets of muta- 
tions or neuroanatomical alterations could 
then be diagnosed and treated differently. 


Inthe genes 


One pillar of this future approach is a 
better understanding of the genetics of 
mental illness. In the past decade, studies of 
psychopathological genetics have become 
large enough to draw robust conclusions. 

The studies reveal that no individual 
gene contributes much to the risk of a 
psychopathology; instead, hundreds of 
genes each havea small effect. A 2009 study’ 
found that thousands of gene variants were 
risk factors for schizophrenia. Many were also 
associated with bipolar disorder, suggesting 
that some genes contribute to both disorders. 

This is not to say that the same genes are 
involved in all brain disorders: far from it. 
A team led by geneticist Benjamin Neale at 
Massachusetts General Hospital in Boston and 
psychiatrist Aiden Corvin at Trinity College 
Dublin found in 2018 that neurological disor- 
ders suchas epilepsy and multiple sclerosis are 
genetically distinct from psychiatric disorders 
such as schizophrenia and depression® (see 
‘Mental map’). 

These studies all looked at common 
variants, which are the easiest to detect. Some 
recent studies focused instead on extremely 
rare variants, which do suggest genetic dif- 
ferences between disorders. A study of more 


than 12,000 people’ found that individuals 
with schizophrenia had an unusually high rate 
of ultra-rare mutations — and that these were 
often unique to one individual. 

The result is a mess. It is difficult to predict 
which risk factors cut across conditions. 
“Some of them are quite broadly shared across 
psychopathology,’ says Neale, “whereas some 
are a bit more specific to one or a handful of 
forms of psychopathology.” 


The p factor 


Some psychiatrists have put forward a radical 
hypothesis that they hope will allow them to 
make sense of the chaos. If disorders share 
symptoms, or co-occur, and if many genes are 
implicated in multiple disorders, then maybe 
there is asingle factor that predisposes people 
to psychopathology. 


“If you have one disorder, 
you're much more likely to 
have another.’ 


The idea was first proposed in 2012 by 
public-health specialist Benjamin Lahey at the 
University of Chicago in Illinois’. Lahey and his 
colleagues studied symptoms in 11 disorders. 
They used statistics to examine whether the pat- 
tern could best be explained by three distinct 
dimensions, or by those three together with 
a ‘general’ predisposition. The model worked 
better if the general factor was included. 

The following year, the hypothesis received 
more support — and a catchy name — from 
husband-and-wife psychologists Avshalom 
Caspi and Terrie Moffitt at Duke University in 
Durham, North Carolina. They used data from 
along-term study of 1,037 people and found 
that most of the variation in symptoms could be 
explained by a single factor". Caspiand Moffitt 
called this the ‘p factor’. Since 2013, multiple 
studies have replicated their core finding. 

Caspiand Moffitt were clear that the p factor 
could not explain everything, and made no 
guesses about its underlying biology, specu- 
lating only that a set of genes might mediate 
it. Others have proposed that the p factor isa 
general predisposition to psychopathology, 
but that other factors — stressful experiences, 
or other gene alterations — nudge a person 
towards different symptoms”. But ifit is real, it 
has astartling implication: there could beasin- 
gle therapeutic target for psychiatric disorders. 

There are already hints that generalized 
treatments could work just as well as targeted 
therapies. A 2017 study” randomly assigned 
people with anxiety disorders, such as panic 
disorder or obsessive-compulsive disorder, to 
receive either a therapy for their specific disor- 
der or ageneralized approach. Both therapies 
worked equally well. 

Finding a physiological basis for the p factor 
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would be the first step towards therapies based 
on it, but only in the past few years have 
researchers found hints of it in genetic and 
neuroanatomical data. One study“ of the genet- 
ics of psychopathology ina UK population, for 
instance, identified a ‘genetic p factor’ — aset 
of genes in which there were variations that 
contributed to the risk of psychopathology. 

Meanwhile, other groups have searched 
for aneuroanatomical change that occurs in 
multiple psychopathologies. The results are 
intriguing, but contradictory. 

One study” of six psychopathologies found 
that the brain’s grey matter shrank in three 
regions involved in processing emotions: the 
dorsal anterior cingulate, right insula and left 
insula. But subsequent studies by Adrienne 
Romer, a clinical psychologist now at Har- 
vard Medical School and McLean Hospital in 
Belmont, Massachusetts, identified a totally 
different trio of regions with roles that include 
managing basic bodily functions and move- 
ment!¢ — the pons, cerebellum and part of the 
cortex. One key to making sense of this might 
be to focus on the brain’s executive function: 
the ability to regulate behaviour by planning, 
paying attention and resisting temptation, 
whichrelies on many brain regions. Romer and 
Satterthwaite have independently found dis- 
ruptions in executive function ina range of psy- 
chopathologies’”"’ — the suspicion being that 
these disruptions could underlie the p factor. 

Most scientists agree that what is needed 
is more data, and many remain unconvinced 
by such simple explanations. “I’m a little less 
certain that that’s how it’s going to play out,’ 
says Neale. At the genetic level at least, he 
says, there are many disorders, such as PTSD 
and generalized anxiety disorder, that remain 
poorly understood. 

All such sweeping hypotheses are prema- 
ture, says Hyman. “I think it’s a time for much 
more empirical research rather than grand 
theorization.” 


Michael Marshall is a freelance writer based 
in Devon, UK. 
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PROFILE OF A 


Scientists are quickly piecing together how the new 
coronavirus operates, where it came from and what it might 
do next — but pressing questions remain. By David Cyranoski 


n 1912, German veterinarians puzzled 
over the case of a feverish cat with an 
enormously swollen belly. That is now 
thought to be the first reported example 
of the debilitating power of a coronavi- 
rus. Veterinarians didn’t know it at the 
time, but coronaviruses were also giving 
chickens bronchitis, and pigs anintestinal 
disease that killed almost every piglet under 
two weeks old. 

The link between these pathogens remained 
hidden until the 1960s, when researchers inthe 
United Kingdom and the United States isolated 
two viruses with crown-like structures caus- 
ing commoncoldsin humans. Scientists soon 
noticed that the viruses identified in sick ani- 
mals had the same bristly structure, studded 
with spiky protein protrusions. Under electron 
microscopes, these viruses resembled the solar 
corona, which led researchers in 1968 to coin 
the term coronaviruses for the entire group. 

It was a family of dynamic killers: dog 
coronaviruses could harm cats, the cat coro- 
navirus could ravage pig intestines. Research- 
ers thought that coronaviruses caused only 
mild symptoms in humans, until the outbreak 
of severe acute respiratory syndrome (SARS) 
in 2003 revealed how easily these versatile 
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viruses could kill people. 

Now, as the death toll from the COVID-19 
pandemic surges, researchers are scram- 
bling to uncover as much as possible about 
the biology of the latest coronavirus, named 
SARS-CoV-2. A profile of the killer is already 
emerging. Scientists are learning that the 
virus has evolved an array of adaptations 
that make it much more lethal than the other 
coronaviruses humanity has met so far. Unlike 
close relatives, SARS-CoV-2 can readily attack 
human cells at multiple points, with the lungs 
and the throat being the main targets. Once 
inside the body, the virus makes use of a 
diverse arsenal of dangerous molecules. And 
genetic evidence suggests that it has been hid- 
ing out in nature possibly for decades. 


Bad family 


But there are many crucial unknowns about 
this virus, including how exactly it kills, 
whether it will evolve into something more — 
or less — lethal and whatit can reveal about the 
next outbreak from the coronavirus family. 

“There will be more, either out there already 
or inthe making,” says Andrew Rambaut, who 
studies viral evolution at the University of 
Edinburgh, UK. 
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Of the viruses that attack humans, 
coronaviruses are big. At 125 nanometres in 
diameter, they are also relatively large for the 
viruses that use RNA to replicate, the group 
that accounts for most newly emerging dis- 
eases. But coronaviruses really stand out for 
their genomes. With 30,000 genetic bases, 
coronaviruses have the largest genomes of 
all RNA viruses. Their genomes are more than 
three times as big as those of HIV and hepatitis 
C, and more than twice influenza’s. 

Coronaviruses are also one of the few 
RNA viruses with a genomic proofreading 
mechanism — which keeps the virus from 
accumulating mutations that could weaken 
it. That ability might be why common anti- 
virals such as ribavirin, which can thwart 
viruses suchas hepatitis C, have failed to sub- 
due SARS-CoV-2. The drugs weaken viruses by 
inducing mutations. But inthe coronaviruses, 
the proofreader can weed out those changes. 

Mutations can have their advantages for 
viruses. Influenza mutates up to three times 
more often than coronaviruses do, a pace that 
enables it to evolve quickly and sidestep vac- 
cines. But coronaviruses have a special trick 
that gives them a deadly dynamism: they fre- 
quently recombine, swapping chunks of their 
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RNA with other coronaviruses. Typically, this 
isa meaningless trading of like parts between 
like viruses. But when two distant coronavirus 
relatives end up in the same cell, recombina- 
tion can lead to formidable versions that infect 
new cell types and jump to other species, says 
Rambaut. 

Recombination happens often in bats, 
which carry 61 viruses known to infect humans; 
some species harbour as many as 12 (ref. 1). In 
most cases, the viruses don’t harm the bats, 
and there are several theories about why bats’ 
immune systems can cope with these invaders. 
Apaper published in February argues that bat 
cells infected by viruses rapidly release a signal 
that makes them able to host the virus without 
killing it?. 

Estimates for the birth of the first corona- 
virus vary widely, from 10,000 years ago to 
300 million years ago. Scientists are now aware 
of dozens of strains’, seven of which infect 
humans. Among the four that cause com- 
mon colds, two (OC43 and HKU1) came from 
rodents, and the other two (229E and NL63) 
from bats. The three that cause severe disease 
— SARS-CoV (the cause of SARS), Middle East 
respiratory syndrome MERS-CoV and SARS- 
CoV-2 — allcame from bats. But scientists think 
there is usually an intermediary — an animal 
infected by the bats that carries the virus 
into humans. With SARS, the intermediary 
is thought to be civet cats, which are sold in 
live-animal markets in China. 

The origin of SARS-CoV-2 is still an open ques- 
tion. The virus shares 96% of its genetic material 
witha virus found ina bat ina cave in Yunnan, 
China‘ — a convincing argument that it came 


from bats, say researchers. But there’s a crucial 
difference. The spike proteins of coronaviruses 
have a unit called a receptor-binding domain, 
which is central to their success in entering 
human cells. The SARS-CoV-2 binding domain is 
particularly efficient, and it differs inimportant 
ways from that of the Yunnan bat virus, which 
seems not to infect people?. 

Complicating matters, a scaly anteater 
called the pangolin showed up with a coro- 
navirus that had a receptor-binding domain 
almost identical to the human version. But the 
rest of the coronavirus was only 90% geneti- 
cally similar, so some researchers suspect the 
pangolin was not the intermediary*. The fact 
that both mutations and recombinations are at 
work complicates efforts to drawa family tree. 

But studies released over the past few 
months, which have yet to be peer-reviewed, 
suggest that SARS-CoV-2 — or a very similar 
ancestor — has been hiding insome animal for 
decades. According to a paper posted online 
in March‘, the coronavirus lineage leading to 
SARS-CoV-2 split more than 140 years ago from 
the closely related one seen today in pangolins. 
Then, sometime in the past 40-70 years, the 
ancestors of SARS-CoV-2 separated from the bat 
version, which subsequently lost the effective 
receptor binding domain that was present in its 
ancestors (and remains in SARS-CoV-2). A study 
published on 21 April came up with very similar 
findings using a different dating method’. 

These results suggest along family history, 
with many coronavirus branches in bats and 
possibly pangolins carrying the same deadly 
receptor binding domain as SARS-CoV-2, 
including some that might have similar 


abilities to cause a pandemic, says Rasmus 
Nielsen, an evolutionary biologist at the Uni- 
versity of California, Berkeley, and co-author 
of the second study. “There is a need for con- 
tinued surveillance and increased vigilance 
towards the emergence of new viral strains by 
zoonotic transfer,” he says. 


Two open doors 


Although the known human coronaviruses 
can infect many cell types, they all mainly 
cause respiratory infections. The difference 
is that the four that cause common colds eas- 
ily attack the upper respiratory tract, whereas 
MERS-CoV and SARS-CoV have more difficulty 
gaining ahold there, but are more successful 
at infecting cells in the lungs. 

SARS-CoV-2, unfortunately, can do both 
very efficiently. That gives it two places to get 
a foothold, says Shu-Yuan Xiao, a pathologist 
at the University of Chicago, Illinois. A neigh- 
bour’s cough that sends ten viral particles your 
way might be enough to start an infection in 
your throat, but the hair-like cilia found there 
are likely to do their job and clear the invaders. 
Ifthe neighbour is closer and coughs 100 par- 
ticles towards you, the virus might be able get 
all the way down to the lungs, says Xiao. 

These varying capacities might explain why 
people with COVID-19 have such different 
experiences. The virus can start in the throat 
or nose, producing a cough and disrupting 
taste and smell, and then end there. Orit might 
work its way downto the lungs and debilitate 
that organ. How it gets down there, whether 
it moves cell by cell or somehow gets washed 
down, is not known, says Stanley Perlman, an 
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Research suggests the 
SARS-CoV-2 virus has an array 
of adaptations that help it 
break into human cells — the 
first step in causing COVID-19 
disease. Scientists are still 
debating many of the details. 


Cell membrane 


1. The spike proteins 
that stud the exterior of 
the virus have receptor 
binding domains that 
are extremely efficient 
at latching onto ACE2 
receptors on human 
cells. 
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3. That exposes fusion Snipped spike protein 
peptides — small 
chains of amino acids 
— that fuse the viral 
membrane with the 
membrane of the host 


cell. 


membrane 
Fusion 
peptides 


immunologist at the University of lowainlowa 
City who studies coronaviruses. 

Clemens-Martin Wendtner, an infec- 
tious-disease physician at the Munich Clinic 
Schwabing in Germany, says it could bea 
problem withthe immune system that lets the 
virus sneak down into the lungs. Most infected 
people create neutralizing antibodies that are 
tailored by the immune system to bind with 
the virus and block it from entering a cell. But 
some people seem unable to make them, says 
Wendtner. That might be why some recover 
after a week of mild symptoms, whereas others 
get hit with late-onset lung disease. But the 
virus can also bypass the throat cells and go 
straight down into the lungs. Then patients 
might get pneumonia without the usual mild 
symptoms such as a cough or low-grade 
fever that would otherwise come first, says 
Wendtner. Having these two infection points 
means that SARS-CoV-2 can mix the transmissi- 
bility of the common cold coronaviruses with 
the lethality of MERS-CoV and SARS-CoV. “It is 
an unfortunate and dangerous combination of 
this coronavirus strain,” he says. 

The virus’s ability to infect and actively 
reproduce in the upper respiratory tract was 
something of a surprise, given that its close 
genetic relative, SARS-CoV, lacks that ability. 
Last month, Wendtner published results® of 
experiments in which his team was able to 
culture virus from the throats of nine peo- 
ple with COVID-19, showing that the virus is 
actively reproducing and infectious there. 
That explains a crucial difference between the 
close relatives. SARS-CoV-2 can shed viral par- 
ticles from the throat into saliva even before 
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symptoms start, and these can then pass easily 
from person to person. SARS-CoV was much 
less effective at making that jump, passing 
only when symptoms were full-blown, mak- 
ing it easier to contain. 

These differences have led to some confu- 
sion about the lethality of SARS-CoV-2. Some 
experts and media reports describe it as less 
deadly than SARS-CoV because it kills about 
1% of the people it infects, whereas SARS-CoV 
killed at roughly ten times that rate. But Per- 
Iman says that’s the wrong way to look at it. 
SARS-CoV-2 is much better at infecting peo- 
ple, but many of the infections don’t progress 
to the lungs. “Once it gets down in the lungs, 
it’s probably just as deadly,” he says. 

Whatit does when it gets downto the lungs 
is similar in some respects to what respiratory 
viruses do, although much remains unknown. 
Like SARS-CoV and influenza, it infects and 
destroys the alveoli, the tiny sacs in the lungs 
that shuttle oxygen into the bloodstream. 
As the cellular barrier dividing these sacs 
from blood vessels break down, liquid from 


the vessels leaks in, blocking oxygen from 
getting to the blood. Other cells, including 
white blood cells, plug up the airway further. 
A robust immune response will clear all this 
out in some patients, but overreaction of the 
immune system can make the tissue damage 
worse. If the inflammation and tissue dam- 
age are too severe, the lungs never recover and 
the person dies or is left with scarred lungs, 
says Xiao. “From a pathological point of view, 
we don’t see a lot of uniqueness here.” 

And as with SARS-CoV, MERS-CoV and ani- 
mal coronaviruses, the damage doesn’t stop 
with the lungs. A SARS-CoV-2 infection can 
trigger an excessive immune response known 
as acytokine storm, which can lead to mul- 
tiple organ failure and death. The virus can 
also infect the intestines, the heart, the blood, 
sperm (as can MERS-CoV), the eye and pos- 
sibly the brain. Damage to the kidney, liver 
and spleen observed in people with COVID-19 
suggests that the virus can be carried in the 
blood and infect various organs or tissues, 
says Guan Wei-jie, a pulmonologist at the 
Guangzhou Institute of Respiratory Health 
at Guangzhou Medical University, China, an 
institution lauded for its role in combating 
SARS and COVID-19. The virus might be able 
to infect various organs or tissues wherever 
the blood supply reaches, says Guan. 

But although genetic material from the 
virus is showing up in these various tissues, 
it is not yet clear whether the damage thereis 
being done by the virus or by acytokine storm, 
says Wendtner. “Autopsies are under way in 
our centre. More data will come soon,” he says. 

Whether it infects the throat or the lungs, 


5. The foreign RNA hijacks the host’s 
cellular machinery to produce RNA and 
proteins that get assembled into new 
virus particles. 
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6. As the virus particles exit the cell, 
furin might act on the spike protein to 
prime it. The new particles can attack 
other cells or leave the body and infect 
other people. 
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SARS-Cov-2 breaches the protective mem- 
brane of host cells using its spike proteins (see 
‘Deadly invader’). First, the protein’s recep- 
tor-binding domain latches on to a receptor 
called ACE2, which sits on the surface of 
the host cell. ACE2 is expressed throughout 
the body on the lining of the arteries and veins 
that course through all organs, but it is par- 
ticularly dense on the cells lining the alveoli 
and small intestines. 

Although the exact mechanisms remain 
unknown, evidence suggests that after 
the virus attaches itself, the host cell snips the 
spike protein at one of its dedicated ‘cleavage 
sites’, exposing fusion peptides — small chains 
of amino acids that help to pry open the host 
cell’s membrane so that the virus’s membrane 
can merge with it. Once the invader’s genetic 
material gets inside the cell, the virus com- 
mandeers the host’s molecular machinery 
to produce new viral particles. Then, those 
progeny exit the cell to go and infect others. 


Power spikes 


SARS-CoV-2 is uniquely equipped for 
forcing entry into cells. Both SARS-CoV and 
SARS-CoV-2 bind with ACE2, but the recep- 
tor-binding domain of SARS-CoV-2 is a par- 
ticularly snug fit. It is 10-20 times more likely 
to bind ACE2 than is SARS-CoV’. Wendtner 
says that SARS-CoV-2 is so good at infecting 
the upper respiratory tract that there might 
even be asecond receptor that the virus could 
use to launch its attack. 

Even more troubling is the fact that 
SARS-COV-2 seems to make use of the enzyme 
furin from the host to cleave the viral spike pro- 
tein. This is worrying, researchers say, because 
furin is abundant in the respiratory tract and 
found throughout the body. It is used by other 
formidable viruses, including HIV, influenza, 
dengue and Ebola to enter cells. By contrast, 
the cleavage molecules used by SARS-CoV are 
much less common and notas effective. 

Scientists think that the involvement of 
furin could explain why SARS-CoV-2 is so 
good at jumping from cell to cell, personto 
person and possibly animal to human. Rob- 
ert Garry, a virologist at Tulane University in 
New Orleans, Louisiana, estimates that it gives 
SARS-CoV-2a100-1,000 times greater chance 
than SARS-CoV of getting deep into the lungs. 
“When I saw SARS-CoV-2 had that cleavage site, 
I did not sleep very well that night,” he says. 

The mystery is where the genetic instruc- 
tions for this particular cleavage site came 
from. Although the virus probably gained 
them through recombination, this particu- 
lar set-up has never been found in any other 
coronavirus in any species. Pinning down its 
origin might be the last piece in the puzzle that 
will determine which animal was the stepping 
stone that allowed the virus to reach humans. 

Some researchers hope that the virus 
will weaken over time through a series of 
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mutations that adapt it to persist in humans. 
By this logic, it would become less deadly and 
have more chances to spread. But researchers 
have not yet found any sign of such weaken- 
ing, probably because of the virus’s efficient 
genetic repair mechanism. “The genome of 
COVID-19 virus is very stable, and I don’t see 
any change of pathogenicity that is caused 
by virus mutation,” says Guo Deyin, who 
researches coronaviruses at Sun Yat-sen 
University in Guangzhou. 

Rambaut, too, doubts that the virus will 
become milder over time and spare its host. 
“It doesn’t work that way,” he says. As long as 
it can successfully infect new cells, reproduce 
and transmit to new ones, it doesn’t matter 
whether it harms the host, he says. 


y 


BY FAR THE MOST 
LIKELY SCENARIOS 
THAT THE VIRUS WILL 
CONTINUE TO SPREAD. 


But others think there is a chance for a better 
outcome. It might give people antibodies that 
will offer at least partial protection, says Klaus 
Stohr, who headed the World Health Organ- 
ization’s SARS research and epidemiology 
division. Stohr says that immunity will not 
be perfect — people who are reinfected will 
still develop minor symptoms, the way they 
do now from the common cold, and there 
will be rare examples of severe disease. But 
the virus’s proofreading mechanism means it 
will not mutate quickly, and people who were 
infected will retain robust protection, he says. 

“By far the most likely scenario is that the 
virus will continue to spread and infect most 
of the world population in a relatively short 
period of time,” says Stohr, meaning one to 
two years. “Afterwards, the virus will con- 
tinue to spread in the human population, 
likely forever.” Like the four generally mild 
human coronaviruses, SARS-CoV-2 would 
then circulate constantly and cause mainly 
mild upper respiratory tract infections, says 
Stohr. For that reason, he adds, vaccines won't 
be necessary. 

Some previous studies support this 
argument. One!’ showed that when people 
were inoculated with the common-cold coro- 
navirus 229E, their antibody levels peaked two 
weeks later and were only slightly raised after 
a year. That did not prevent infections a year 
later, but subsequent infections led to few, if 
any, symptoms and a shorter period of viral 
shedding. 

The OC43 coronavirus offers a model for 
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where this pandemic might go. That virus 
also gives humans common colds, but genetic 
research from the University of Leuven in 
Belgium suggests that OC43 might have been 
akiller in the past”. That study indicates that 
OC43 spilled over to humans in around 1890 
from cows, which got it from mice. The scien- 
tists suggest that OC43 was responsible for a 
pandemic that killed more than one million 
people worldwide in 1889-90 — an outbreak 
previously blamed on influenza. Today, OC43 
continues to circulate widely and it might be 
that continual exposure to the virus keeps the 
great majority of people immune to it. 


End game 


But even if that process made OC43 less 
deadly, it is not yet clear whether something 
similar would happen with SARS-CoV-2. A 
study in monkeys showed that they retained 
antibodies to SARS-CoV-2, but the researchers 
only reported on the first 28 days after infec- 
tion, so it is unclear how long the immunity 
lasted”. Concentrations of antibodies against 
SARS-CoV also dropped significantly over a 
two- to three-year period”. Whether those low- 
ered levels would be enough to prevent infec- 
tion or reduce severity has not been tested. 
Cats, cows, dogs and chickens do not seem 
to become immune to the sometimes deadly 
coronaviruses that infect them, leaving vet- 
erinarians over the years to scramble for vac- 
cines. Despite all the questions about whether 
people retain any immunity to SARS-CoV-2, 
some countries are promoting the idea of giv- 
ing survivors ‘immunity passports’ to allow 
them to venture out without fear of being 
infected or infecting others. 

Many scientists are reserving judgement on 
whether the tamer coronaviruses were once 
as virulent as SARS-CoV-2. People like to think 
that “the other coronaviruses were terrible and 
became mild”, says Perlman. “That’s an opti- 
mistic way to think about what’s going on now, 
but we don’t have evidence.” 


David Cyranoski reports for Nature from 
Shanghai, China. 
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Science in culture 
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In the seventeenth century, Galileo faced persecution for his heretical views on astronomy. 


Galileo’s story is 
always relevant 


With science denialism stronger than ever, 
who better to revisit? By Alison Abbott 


s there room in the crowded canon fora 

new biography of Galileo Galilei? Astro- 

physicist Mario Livio is betting so. His 

Galileo and the Science Deniers aims to 

stand out by placing the original Renais- 
sance man and his discoveries in modern sci- 
entific and social contexts. In particular, he 
argues, the charges of heresy that Galileo faced 
for his scientific claims in the seventeenth 
century have their counterparts in science 
deniers’ condemnations today. 

Born in 1564 in Pisa, Italy, into an intel- 
lectual family of declining fortune, Galileo 
pursued medicine at the University of Pisa. 
But he soon abandoned his course to study 
mathematics, his enduring passion. The 


Universe, he famously wrote, “is written in 
the language of mathematics”. It was an argot 
that allowed him to break reliance on the 
Aristotelian cosmology prized by the Catholic 
Church, and to forge anew, quantitative study 
of nature. 


~ Galileo and the 

\ Science Deniers 
Mario Livio 

Simon & Schuster 
(2020) 
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In Pisa, Galileo delved into mechanics, 
using his observations to question accepted 
ideas about motion (although Livio reminds 
us that Galileo probably never conducted the 
famous experiment in which he supposedly 
dropped spheres from the city’s leaning tower 
and found that they fell at the same speed 
regardless of their mass). 

In1592, he moved to the University of Padua 
in Italy, an intellectually liberal environment 
happily beyond the jurisdiction of the Pope. 
Here, he began to discuss the revolutionary 
theory, proposed by Polish mathematician 
Nicolaus Copernicus in 1543, that rather than 
being the Universe’s fixed centre as Aristotle 
had insisted, Earth was in fact orbiting the Sun. 

Livio structures his account partly around 
specific works, including Galileo’s 1610 The 
Sidereal Messenger, which described his major 
astronomical observations. While working 
in Padua, Galileo often visited the nearby 
port of Venice, where he was introduced 
to the ‘spyglass’, a new-fangled instrument 
from Holland that could be used to see ships 
approach. Galileo turned it to the heav- 
ens to make the discoveries that changed 
the course of astronomy, and launched his 
own fate. 

Inhis first observations, Galileo saw that the 
Moonwas notasmooth sphere, but was moun- 
tainous. This contradicted the church’s view 
that the heavens were pristine and unchangea- 
ble, unlike the corrupt, mutable Earth. He also 
saw Satellites orbiting Jupiter, shooting ahole 
inthe geocentric argument that if Earth were 
to move, it would lose its Moon. 

In 1610, against his friends’ advice and in 
pursuit of more money, Galileo left the pro- 
tective environment of Padua and moved to 
Florence to work for Cosimo II de’ Medici, 
the Grand Duke of Tuscany. Despite now liv- 
ing within the Pope’s sphere of influence, in 
1632, Galileo published his book Dialogue 
Concerning the Two Chief World Systems, an 
imaginary debate between Salviati, an advo- 
cate of heliocentrism, anda witless geocentrist 
named Simplicio. 

Putting the Pope’s view — that God’s universe 
is inherently unknowable — into the mouth 
of a fool was risky. Galileo’s fame and fortune 
rose, but so did the power and determina- 
tion of his enemies, and the Holy Inquisition 
finally claimed him. Ina riveting account of 
the trial, Livio describes how the Inquisition 
dismissed Galileo’s claim that the Dialogue 
was a balanced argument that ultimately 
rejected Copernicanism. On 22 June 1633, 
one of the world’s most venerated scien- 
tists was on his knees before its members, 
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renouncing the errors and heresies inherent 
to Copernicanism. 

Livio parses the considerable, and often 
ambiguous, evidence about Galileo’s life 
and trial, and comments on the conclusions 
reached by various historians. The official 
summary of the trial proceedings, he writes, 
“revealed a clear intention to present Galileo 
inthe worst possible light”. Like others before 
him, Livio doubts claims that Galileo left the 
court defiantly muttering about Earth, “and 
yet it moves”. 

The non-chronological zigzagging of the 
book can be hard to follow, but allows Livio to 
focus onthemes, suchas Galileo’s polymathy. 
He highlights Galileo’s lifelong study of the 
great Italian poets Dante Alighieri, Torquato 
Tasso and Ludovico Ariosto. And he notes that 
the astronomer’s drawing skills and knowledge 
of perspective allowed him to understand that 
the shadings on the Moon were shadows cast 
by mountains, and to depict them in lovely 
watercolours. 

Livio is at his best when he discusses how 
Galileo’s scientific understanding compares 


“Critics from creationists 
to Donald Trump discredit 
arguments by exploiting 
gapsin knowledge.” 


with that of researchers today. Galileo sug- 
gested, for example, that comets might be 
optical phenomena caused by the reflection 
of sunlight by vapours released from Earth. 
We now know they are ‘dirty snowballs’ made 
of ice, rock, dust and frozen gases. Some of 
these components vaporize when they get 
close to the Sun, giving comets two tails: one 
of dust that reflects sunlight, and one of gas 
that glows as it ionizes. 

And what of today’s science deniers? Livio 
briefly addresses how religion and business 
interests still conspire to attack evidence 
for evolution and anthropogenic climate 
change. In general, “processes that are not 
fully understood don’t constitute flaws”, he 
points out, but critics from creationists to 
Donald Trump discredit scientific arguments 
by exploiting gaps in knowledge. It’s a 
chillingly relevant theme, yet the parallels he 
draws between Galileo’s trial and contem- 
porary science wars feel thin, and there’s 
a frustrating lack of examples to demon- 
strate the continuity of denialism through 
the centuries. 

Nonetheless, Livio has added to the canon 
an accessible and scientific narrative, in 
which a profound love for Galileo shines 
through. 


Alison Abbott is a writer in Munich, Germany. 
e-mail: abbott.alison@googlemail.com 
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Books in brief 


TOO SMART 


JATHAN SADOWSKI 


Too Smart 

Jathan Sadowski MIT Press (2020) 

‘Smart’ technology in hands, homes and cities will “measure, 

monitor, manage and monetize all aspects of our lives”, shows Jathan 
Sadowski. One toothbrush, for example, uses sensors to record 
when, how long and how well users brush their teeth, sending the 
information to cloud servers owned by the manufacturer or a third 
party. Users might choose to share such data with their dentists — but 
surely not with dental insurers without express permission. This hype- 
and jargon-free warning deserves a wide welcome. 


RIVERS OF 
Bes | 


HOW A NATURAL FORCE RAISED 
KINGDOMS, DESTROYED CIVILIZATIONS, 
AND SHAPES OUR WORLD 


Rivers of Power 

Laurence C. Smith Little, Brown Spark (2020) 

Many civilizations began beside great rivers: the Tigris and Euphrates in 
what’s now Iraq, the Nile in Egypt, the Indus in India and Pakistan, and 
the Yellow in China. Rivers run through almost all our great cities, notes 
Earth scientist Laurence Smith in his highly readable history, extolling 
their fundamental benefits: “access, natural capital, territory, well- 
being and a means of projecting power”. No wonder we still speak of 
‘crossing the Rubicon’, a border river traversed by Julius Caesar's army 
in defiance of republican law, triggering the rise of the Roman empire. 


OUR INAL 
WARNING 


SIX 
DEGREES 
OF 
CLIMATE 
EMERGENCY. 


MARK 
LYNAS 


Our Final Warning 

Mark Lynas Fourth Estate (2020) 

The average global temperature has risen 1°C above pre-industrial 
levels. This update to environmental activist Mark Lynas’s 2007 book 
Six Degrees explores the likely effects of further rises. Its horrifying 
eloquence derives from its restraint, grounded in research, including 
a 2019 Nature assessment of future carbon dioxide emissions from 
existing infrastructure, which leads Lynas to forecast total emissions 
double those required to keep increases to the 1.5°C agreed in Paris 
in 2015. “This really is our final warning,” he concludes. 


How Bad Policy and Bad Evidence 
Harm\People-with Cancer 


Malignant 

Vinayak K. Prasad Johns Hopkins Univ. Press (2020) 

The cost of new cancer treatments in the United States is routinely 
US$100,000 per year per person; some are more than $400,000 
per dose. Oncologist Vinayak Prasad’s insider study analyses how 

US scientific, industrial and regulatory policies — as well as financial 
conflicts of interest for physicians — “incentivize the pursuit of 
marginal or unproven therapies at lofty and unsustainable prices”. 
Aimed at general readers (including patients), oncology trainees and 
experts in health-care policy, it informs and disturbs throughout. 


Other Natures 


CLARA BOSAK-SCHROEDER 


Other Natures 

Clara Bosak-Schroeder Univ. California Press (2020) 

US natural-history museums haunt classicist Clara Bosak-Schroeder’s 
study of how ancient Greek historians Herodotus and Diodorus Siculus 
portrayed non-Greek peoples, such as Ethiopians and Persians. She 
begins with her childhood fascination with a diorama of a crouching 
African woman holding a pink grub to her mouth, and ends as she adopts 
a killer whale at Washington’s Whale Museum. Ancient ethnographies, 
she says, can help people “confront environmental degradation and 
transform their own relationships to other species”. Andrew Robinson 
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Obituary 


Philip W. Anderson 


(1923-2020) 


Nobel winner who transformed condensed-matter and particle physics. 


hilip Warren Anderson, who has died 
aged 96, led the development of con- 
densed-matter physics. In 1977, he won 
ashare of the Nobel Prize in Physics for 
his discovery of electron localization, 
whereby disordered metals become insula- 
tors, and for his pioneering work on mag- 
netism. His studies of superconductors led 
him to propose howthe force carriers between 
subatomic particles, such as photons, acquire 
mass: the Anderson-Higgs mechanism is now 
part of the standard model of particle physics. 

Quantum physics’ early triumphs persuaded 
some that advances would derive exclusively 
from reducing nature to its most fundamental 
particles. Anderson rejected this view, argu- 
ing that the emergent properties that develop 
when matter comes together are equally signif- 
icant. He reasoned that, in science, each new 
level of complexity requires new fields, con- 
necting physics, chemistry, biology, computer 
science and economics. 

Anderson was bornin 1923. Thesonofa plant 
pathologist, he grew upin Urbana-Champaign, 
Illinois, and developed an early passion for 
speed skating. At 16, he went to Harvard Uni- 
versity in Cambridge, Massachusetts, where 
he fell in love with physics. During the Second 
World War, he worked on radar at the US Naval 
Research Laboratory in Washington DC. He 
first learnt about quantum mechanics when 
a colleague gave him a precious text on it to 
repay a wartime loan. Returning to Harvard 
for graduate work, he studied the effects of 
pressure on the broadening of spectral lines, 
mentored by John Hasbrouck Van Vleck, one 
of those with whom he shared the Nobel prize. 

In1949, William Shockley, co-inventor of the 
transistor, hired Anderson to join the theory 
group at Bell Telephone Laboratories in Murray 
Hill, NewJersey. Here, Anderson first focused 
on magnetism. Quantum mechanics predicts 
that electrons carry tiny magnetic moments 
called spin. The attraction of fridge magnets 
results when spins orient in the same direc- 
tion. By contrast, the read heads of hard disk 
drives rely onan antiferromagnet, in which the 
spin alternates direction on adjacent atomic 
layers. By combining the effects of electron 
repulsion and quantum mechanics, Anderson 
explained how iron atoms become magnetic, 
and accounted for the interactions that result 
in antiferromagnetism. 

In 1958, Anderson discovered a remark- 
able parallel between magnetism and 


superconductivity. Whereas magnets con- 
centrate fields, superconductors expel them, 
allowing them to levitate. Following John 
Bardeen, Leon Cooper and Robert Schrieffer’s 
discovery that superconductivity results from 
electrons forming Cooper pairs, Anderson real- 
ized that these pairs are a kind of pseudospin. 
In a magnet, fluctuations in the magnetiza- 
tion propagate, forming a spin wave. Cooper 
pairs are charged, and pseudospin waves 
cause an electric current that interacts with 
the electromagnetic fields, which are carried 
by photons. When Anderson calculated the 
motion of photons inside a superconductor, 
he found that they acquired a mass. 

Anderson realized that there were still- 
deeper parallels between superconductivity 
and particle physics. In 1962, he proposed a 
mechanism for subatomic force carriers called 
gauge bosons to acquire mass ina kind of cos- 
mic superconductor, now knownas the Higgs 
field after the British physicist Peter Higgs. 
Anderson’s work was prominently cited in 
Higgs’s 1964 paper predicting the existence 
of the Higgs boson. 

Work on semiconductors at Bell Labs led 
Anderson to propose in 1957 that disorder — 
suchas that caused by defects and impuritiesin 
a material — localizes electron waves, produc- 
ing an insulator. Today, Anderson localization 
is recognized as a general property of all kinds 
of waves in disordered media. But the original 
idea was radical and took two decades and the 
contributions of many leading physicists to be 
developedin detail. Andersonalso studied spin 


© 2020 Springer Nature Limited. All rights reserved. 


glasses, atype of random magnet that contains 
both ferromagnetic and antiferromagnetic 
interactions. Working with Sam Edwards at the 
University of Cambridge, UK, he modelled the 
glass’s capacity to ‘remember’ the history of its 
environment. The Edwards—Anderson model 
was an early forerunner of the neural networks 
used in modern machine learning. 

Anderson had an artist’s eye for original 
interpretation, and worked closely with 
experimentalists to develop ideas that led to 
new fields of study. During a year in Japan in 
1953, visiting the mathematical physicist Ryvogo 
Kubo at the University of Tokyo, he became a 
master in the ancient game of Go. Just as he 
could instinctively see several moves beyond 
his opponent, he often reached an under- 
standing of physics that was hard for others 
to grasp. Unusually intuitive for a theoretical 
physicist, he was able to reduce complex prob- 
lems so they would succumb toa minimum of 
mathematics. 

To the many younger physicists he men- 
tored, with whom he generously shared 
inspiring ideas, he was something of a guru. 
He would invite them, and their families, for 
dinner, developing lasting friendships. He had 
apassion for the outdoors, spending Sundays 
gardening and clearing the woodland around 
his house. 

From 1967 to 1975, Anderson was a visiting 
professor at Cambridge, where he was a close 
associate of Nevill Mott, the third of the 1977 
Nobel prizewinners. In 1975, he took up a posi- 
tion at Princeton University in New Jersey, 
where he devoted much of his research to high- 
temperature superconductivity (discovered in 
1986 by Georg Bednorz and Alex Miller). 

Anderson’s resonating valence bond (RVB) 
theory, proposed in 1987, once again bor- 
rows ideas from magnetism, positing that 
high-temperature superconductivity results 
from the injection of charge into an insulating 
quantum spin liquid. High-temperature super- 
conductivity has still not been achieved, but 
many think that RVB theory contains the seeds 
of howit might be. Although Anderson has left 
us, his ideas are still many moves ahead. 


Piers Coleman is a distinguished professor 
at the Rutgers Center for Materials Theory, 
Rutgers, the State University of New Jersey. 
He was a graduate student with Philip W. 
Anderson at Princeton in 1980-1984. 
e-mail: coleman@physics.rutgers.edu 
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Readers respond 


Correspondence 


COVID-19: remote 
tech spares clinicians 


Virtual-reality technology 
backed by an ultrafast 5G 
mobile network (VR+5G) 

has complemented rigorous 
preventive measures in 

cutting the infection rate of 
medical staff on the front line 
of the COVID-19 epidemic ata 
university hospital in Zhejiang, 
China. This technology enables 
remote diagnosis and treatment 
monitoring of patients, thereby 
helping to reduce direct 
contact with medical staff (see 
go.nature.com/35ex2py). 

This VR system allows medical 
personnel to interact witha 
digital 3D representation of 
the patient’s environment 
through a computer-generated 
simulation in real time. Tracking 
data collected through such 
VR+5G applications could also 
provide valuable information 
for policymakers, for example 
on factors that influence the 
course, infectivity and severity 
of the illness. 

VR+5G would also improve the 
monitoring and communication 
capability of people at home. 


Guoen Cai Fujian Medical 
University Union Hospital, China. 


Guofa Cai Guangdong University 
of Technology, China. 
caiguofa2006@gdut.edu.cn 
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Arab students thrive 
in Israeli university 


For long-standing reasons, the 
Arab minority in Israel has been 
under-represented in Israeli 
universities. The Technion — 
Israel Institute of Technology 
has rectified this by developing 
an empowerment model that is 
now being extended to all other 
academic institutes in Israel 
through the Israeli Council for 
Higher Education. 

The Technion resolved two 
weaknesses, without resorting 
to US-style affirmative action 
to resolve discrimination. 
These were the inadequacy 
of Arab students’ secondary- 
school preparation for higher 
education, and the difficulty 
of adjusting to campus life in 
the face of cultural and age- 
related differences from the 
Jewish majority. The Technion 
set up coaching programmes 
for prospective students, 
workshops in core courses 
run by high-achieving Arabic- 
speaking students, personal 
tutoring in social engagement, 
and professional guidance on 
self-management. 

Since 2004, our numbers 
of Arabic-speaking students 
and Arab female students have 
increased by 200% and 350%, 
respectively. The number 
of Arab students deemed 
‘outstanding’ has risen by 
1,800%, and the number of Arab 
graduate students by 120%. 
Dropout rates among our Arab 
students have fallen by 67%. 


Hossam Haick, Peretz Lavie 
Technion — Israel Institute of 
Technology, Haifa, Israel. 
hhossam@technion.ac. il 


Fatina Abreek-Zubiedat Swiss 
Federal Institute of Technology 
(ETH), Zurich, Switzerland. 


Learn truths from 
replicability testing 


More incentive is needed to spur 
investigation into replication 
issues and null results (see 
Nature 578, 489-490; 2020). 
For example, experienced 
scientists could encourage 
junior researchers to allocate 
part of their time to verifying 
other researchers’ results, 
which would also provide them 
with essential insights into the 
scientific method. To support 
such ventures, we are launching 
ReScienceX, a free-to-publish 
and free-to-read, peer-reviewed 
journal that will be devoted to 
reproduction and replication 
experiments. 

Research students and 
early-career researchers 
should be given credit for 
publishing replication studies. 
This would have the advantage 
of also providing them with 
valuable research training in 
experimental design and the use 
of analytical tools. It is currently 
not easy for researchers aspiring 
to tenured positions to divert 
from generating high-impact 
publications, which are still the 
prime measure of performance 
(Nature Hum. Behav. 3, 1001; 
2019). In our view, however, they 
could significantly contribute 
to scientific endeavour by 
incorporating rigorous 
replication studies into their 
daily routine and embracing null 
results. 


Etienne B. Roesch University of 
Reading, Reading, UK. 
e.b.roesch@reading.ac.uk 


Nicolas Rougier Inria Bordeaux 
South West Research Centre, 
Talence, France. 
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Don’t underrate 
high-flyer SOFIA 


I find your assessment of 

the scientific worth of the 
Stratospheric Observatory for 
Infrared Astronomy (SOFIA) 
overly harsh (Nature 580, 314; 
2020). The facility provides 
invaluable data that would 
otherwise be unobtainable until 
well into the future. 

In addition to the unique 
features you mention, SOFIA is 
the only observatory open to 
the community that operates 
inthe infrared, from 28 to 
300 micrometres. Data from 
these wavelengths harbour 
clues about star formation 
from interstellar reservoirs of 
cold gas and dust, hinting at 
how our Solar System came 
into being. They also hold 
information about stellar death, 
the entourage of massive black 
holes in galactic nuclei and 
interstellar chemistry. 

Balloon platforms can 
operate in this spectral region, 
but do not offer SOFIA’s diverse 
observing opportunities. The 
observatory’s instruments are 
much in demand, particularly 
the far-infrared camera and 
polarimeter, and the GREAT 
heterodyne spectrometer. 
These generate 40 or so papers 
annually from this still-young 
facility. 

There are plans to launch 
satellite observatories that 
could replace SOFIA — including 
ajoint Japanese-European 
project called the Space Infrared 
Telescope for Cosmology and 
Astrophysics, or SPICA, and the 
US Origins Space Telescope. 
However, the cost of these 
would far exceed the cost of 
continuing to maintain SOFIA. 
Furthermore, neither of these 
observatories could be launched 
until well into the 2030s. 


Mark R. Morris University of 
California, Los Angeles, USA. 
morris@astro.ucla.edu 


Expertinsight into current research 


News & Views 


Neurodegeneration 


Lipid carrier breaks barrier 
in Alzheimer’s disease 


Makoto Ishii & Costantino ladecola 


People who carry the gene variant APOF4 are at 
higher-than-average risk of developing Alzheimer’s disease. It 
emerges that this variant is linked to defects in the blood-brain 
barrier and subsequent cognitive decline. See p.71 


The best-known hallmarks of Alzheimer’s 
disease are clumps of misfolded amyloid-B 
(AB) and tau proteins, which aggregate in the 
brain. However, there is increasing aware- 
ness that AB and tau might not be the whole 
story — alterations in the blood-brain barrier 
(BBB) have also emerged as early markers of 
this neurodegenerative disorder’. The degree 
of disruption to the BBB correlates with the 
degree of cognitive dysfunction that aperson 
experiences’, but what causes BBB break- 
down has been unknown. Montagne et al.’ 
present evidence on page 71 that the leading 
genetic risk factor for Alzheimer’s disease, 
apolipoprotein E4, is linked to BBB breakdown. 

The gene apolipoprotein E (APOE) encodesa 
major lipid-carrier protein, ApoE, inthe brain’. 
Thereare three predominant variants of APOE: 
APOE2, APOE3 and APOE4. As with almost all 
genes, people carry two copies of APOE, which 
can be either the same or different variants. 
Compared with the more-common APOEF3 
variant, APOF4 markedly increases the risk of 
Alzheimer’s disease — up to 4-fold in people 
with one copy of this variant, and 15-fold in 
people who have two copies*. People carrying 
APOE4 who do contract Alzheimer’s disease 
also tend to develop symptoms of the disorder 
earlier than those who develop the disease but 
do not carry the variant’*. 

Proteins from blood plasma have been 
found in the cerebrospinal fluid (the liquid 
that surrounds the brain and spinal cord) of 
cognitively healthy people who carry APOE4 
and who subsequently go on to develop 
Alzheimer’s disease. These proteins have 
presumably leaked through the BBB, indi- 
cating that the integrity of the barrier is lost 
before cognition declines®. Evidence from 
mouse models, and from the brains of people 
who have died with Alzheimer’s disease, 


suggests that BBB breakdown is caused by 
the degeneration of pericytes — cells nestled 
in the wall of cerebral capillaries. These cells 
normally safeguard the BBB’ by preventing the 
breakdown of junctions between endothelial 
cells, which make up the capillary walls. 
Whether ApoF4 is responsible for early BBB 
dysfunction in Alzheimer’s disease, by itself 
or in concert with Af and tau, was unknown. 
Montagne and colleagues set out to address 
this knowledge gap. The authors used a tech- 
nique called dynamic contrast-enhanced 
magnetic resonance imaging to investigate 
the permeability of the BBB in people who 
had either healthy cognition or mild cogni- 
tive impairment (a prelude to Alzheimer’s 


Pericyte 


Endothelial 
cell 


disease), grouped according to their APOE 
status. They found that people who were 
cognitively healthy and carried either one or 
two copies of APOF4 had a leaky BBB in two 
brain regionsimportant for memory and cog- 
nition — the hippocampus and the parahippo- 
campal gyrus. This leakage was worse in APOE4 
carriers who exhibited mild cognitive decline. 

Remarkably, these effects preceded any 
signs of tissue loss in the hippocampus and 
parahippocampal gyrus, attesting to the 
idea that BBB disruption is an early event in 
the onset of neurodegeneration. BBBleakage 
was independent of AB and tau accumula- 
tion, which the authors assessed both by 
studying samples of cerebrospinal fluid and 
through another brain-imaging technique, 
positron emission tomography. Montagne 
and co-workers found that, unlike in APOF4 
carriers, the BBB was intact in cognitively 
healthy APOES carriers. It was, however, 
leaky in APOE3 carriers who showed cognitive 
impairment — although less so than in APOE4 
carriers atan equivalent stage of impairment. 

Next, Montagne et al. examined whether 
BBB breakdown in APOF4 carriers was linked 
to pericyte degeneration. In support of this 
idea, they found that a biomarker of pericyte 
injury — a soluble form of a protein known as 
platelet-derived growth factor-receptor-B 
(sPDGFRB) — was elevated in the cerebrospinal 
fluid of APOF4 carriers compared with APOE3 
carriers. High levels of the protein in people 
who carried APOF4 were associated with a 


Cerebral 


capillary 


BBB disruption 


Cognitive decline 


Junction between 
cells disrupted 


Figure 1| The gene variant APOE4 and Alzheimer’s disease. People who carry APOF4 are at heightened 

risk of Alzheimer’s disease. Montagne et al. provide evidence that ApoEF4 protein is secreted by cells called 
pericytes, which abut endothelial cells that line cerebral capillaries at the blood-brain barrier (BBB). 
Secreted ApoF4 activates the protein cyclophilin A (CypA) in the pericytes. This triggers a downstream 
signalling pathway involving activation of the inflammatory protein matrix metalloproteinase-9 (MMP9) 

in pericytes, and possibly also in endothelial cells. This causes disruption of junctions between adjoining 
endothelial cells, opening the BBB in brain regions involved in learning and memory. Disruption of the BBB is 
associated with impaired cognition, although the mechanisms that link the two are unclear. 
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News & views 


leaky BBB and cognitive impairment. sPDGFRB 
elevation was independent of AB and tau. 

The authors then looked for insight into the 
mechanisms by which pericytes might become 
injured. They focused on cyclophilin A (CypA) 
and matrix metalloproteinase-9 (MMP9), two 
proteins that are part of an inflammatory 
pathway implicated in APOE4-driven pericyte 
damage and BBB breakdown‘. Levels of CypA 
and MMP%9 in the cerebrospinal fluid were 
higher in APOE4 carriers who had mild cog- 
nitive impairment than in cognitively healthy 
APOEF4 carriers or APOE3 carriers who had 
comparable cognitive dysfunction. Again, this 
change was not related to increases in AB or tau. 

Finally, the researchers generated pericytes 
in vitro from human induced pluripotent 
stem cells that expressed APOE3 or APOE4. 
They found that APOE4-expressing pericytes 
secreted substantially more CypA and MMP9 
than did APOEF3 pericytes. ApoE4 (but not 
ApoE3) secreted by pericytes activates the 
CypA-MMP9 pathway on nearby pericytes 
— the cells therefore cause their own demise. 
ApoF4 could also activate the CypA-MMP9 
pathway in endothelial cells, which are suscep- 
tible to the harmful effects of APOF4 (ref. 7). 
Therefore, injury to pericytes and endothelial 
cells might both cause BBB leakage (Fig. 1). 

These observations cast new light on APOF4 
that runs contrary to the widely held idea that 
this gene variant contributes to Alzheimer’s 
disease solely by promoting Af and tau accu- 
mulation‘. Instead, it seems that BBB dys- 
function might explain why APOF4 carriers 
are susceptible to Alzheimer’s disease. The 
authors’ findings might also explain why 
APOF4 carriers have worse outcomes follow- 
ing stroke or traumatic brain injury® than 
do people who carry other APOE variants. 
However, as Alzheimer’s disease progresses, 
APOE4 could also slow Af and tau clearance, 
exacerbating declines in cognition. 

Even more striking is the finding that early 
drivers of cognitive impairment differ between 
APOF4 and APOE3 carriers. Montagne and col- 
leagues’ findings indicate that activation of the 
CypA pathway and pericyte damage might not 
beinvolved in cognitive impairmentin people 
who carry the most common APOE variant, 
APOE3. But whether a leaky BBB caused by 
factors that are independent of pericytes (for 
example, damage to endothelial cells caused 
by AB; ref. 1) contributes to cognitive impair- 
ment in APOE3 carriers remains unclear. The 
role of the BBB in APOEF2 carriers, which was 
not assessed in the current study, also remains 
unknown. Although APOE2is associated with 
a reduced risk of Alzheimer’s disease com- 
pared with other APOE variants, this is unlikely 
to result from a more resilient BBB, because 
APOE2 carriers have an increased risk of micro- 
haemorrhages, suggesting vascular frailty’. 

Whether and how BBB breakdown leads 
to cognitive impairment also remains to be 
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determined. Is it a cause or a consequence of 
the disease process? Evidence from mice indi- 
cates that some proteins in the blood, suchas 
fibrinogen, damage the synaptic connections 
between neurons’. But a pathogenic role for 
these proteins in the human brain has not yet 
been demonstrated. 

Irrespective of these questions, Montagne 
et al. have broadened our understanding of 
how APOF4 promotes cognitive impairment. 
They have also demonstrated that different 
APOE statuses can promote disease through 
different mechanisms. A deeper appreciation 
of how gene variants shape Alzheimer’s dis- 
ease might prove crucial for more-personal- 
ized approaches to treating this prevalent and 
incurable disease. 


Atomic physics 
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Exotic helium atom lit up 


Niels Madsen 


An elusive type of atom knownas pionic helium has been 
directly excited by laser light for the first time. The work 
establishes a promising experimental platform for probing 


fundamental physics. See p.37 


Exotic atoms are those in which one or more 
of the constituents of normal atoms have 
been replaced by an exotic particle, such as 
an antimatter particle. These atoms can then 
be probed to search for any tiny discrepancies 
in their properties from those predicted by 
models using techniques that underpin the 
world’s most accurate timekeepers, atomic 
clocks — and thereby opening a window onthe 
foundations of physics. On page 37, Horiet al.’ 
are the first to report laser excitation of helium 
atoms in which one electron has been replaced 
by asubatomic particle called a pion. 

The interest in exotic atoms arises from 
the fact that they often facilitate the most 
basic experimental strategy used in physics: 
changing a single parameter or component 
in an otherwise complex system, to observe 
the effect. In practice, this is not as simple as 
it might seem. Different particles can have 
different masses or charges, and might inter- 
act with their surroundings differently in other 
subtle ways. However, such subtleties often 
add to the value of exotic atoms. 

As the techniques needed to study exotic 
atoms improve, increasing numbers of 
scientists are working with these atoms to 
investigate the fundamental properties of 
nature. A good example of this is the ‘proton 
radius puzzle’, which arose from a study of 
muonic hydrogen’ — ahydrogen atom in which 
the electron has been replaced by asubatomic 
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muon particle (muons have similar properties 
to electrons, but have about 200 times 
greater mass). 

Muonic hydrogen was used to determine 
a key property of the proton known as the 
charge radius, but the value obtained was 
about seven standard deviations away from 
the expected value at the time. The value 
obtained using muonic hydrogen has since 
been independently confirmed ina study of 
ordinary hydrogen’, and also in experiments 
in which electrons are scattered from pro- 
tons*, potentially clarifying the true value of 
the proton radius and thus solving the puzzle. 
Nevertheless, muonic hydrogen aptly illus- 
trates how exotic, sometimes short-lived, 
atomic systems can be used to poke holes in 
seemingly well-established results. 

Animportant feature of exotic atoms that 
adds to their utility as probes for fundamental 
physics is that they are bound systems (energy 
is needed to pull their components apart), with 
multiple internal energy states. Transitions 
between these states are therefore amenable 
to study by laser spectroscopy, the most pre- 
cise measurement tool in the physics toolkit. 
The study of transitions in atoms — and par- 
ticularly inthe hydrogen atom — is an ongoing 
effort that has spanned more than two centu- 
ries. It inspired Niels Bohr’s groundbreaking 
model of the atom in the early twentieth 
century, for example, and has driven much 
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Figure 1| Howto make and excite pionic helium. Hori et al.' fired a beam of negatively charged subatomic particles, called pions, at liquid helium. When a pion 
struck a helium atom, it could knock out one of the electrons and replace it in a high-energy orbit around the helium nucleus (which consists of two protons and two 
neutrons), thus forming an exotic atom known as pionic helium. The authors fired laser light at the exotic atoms and thus observed the transition of the piontoa 
lower-energy orbit — a process that triggers the ejection of the remaining electron. This ejection speeds up the absorption of the pion by the nucleus (not shown), 


which finally breaks apart. 


of the development of quantum mechanics. 

Today, atomictransitions are the foundation 
on which all time measurements are built: a 
transition in the caesium-133 atom provides 
a reference value that underpins the Inter- 
national System of Units definition of the 
second. Therefore, spectroscopic techniques 
for measuring transitions are constantly being 
refined, and the best measurements can now 
reach staggering precisions — to almost 
18 significant figures’. The most precise meas- 
urements of exotic atoms still lag some way 
behind, but ameasurement on antihydrogen 
(the bound state of an antiproton and an anti- 
electron) achieved a milestone precision of 
almost 12 significant figures®, thus paving 
the way for extremely sensitive tests of the 
fundamental properties of antimatter. 

In their seminal work, Hori et al. record the 
first observation of a transition in a ‘pionic’ 
helium atom. In such an atom, one of the two 
electrons of a helium atom has been replaced 
by asubatomic particle called a pion. Pions 
were discovered’ by Cecil Powell and co-work- 
ers in 1947, but their existence was first pre- 
dicted® in 1935 by Hideki Yukawa. They belong 
to the family of subatomic particles known as 
mesons, which are made up of a quark and an 
antiquark; quarks are the particles that make 
up protons and neutrons. 

Pions are short-lived particles that comein 
positively charged, negatively charged and 
neutral varieties. The negatively charged 
pions used by Hori et al. have a lifetime of 
only 26 nanoseconds when isolated. It is thus 
no small feat that the experimenters not only 
succeeded in replacing an electron in helium 
atoms with a pion, but also observed the 
resulting exotic atom undergo a quantum tran- 
sition. A further difficulty is that the lifetime 
of the pion in the exotic atom can be reduced 
to picoseconds because of its vicinity to the 
atom’s nucleus. 

The authors prepared pionic helium atoms 
by firing a beam of pions at a liquid-helium 
target. In Hori and colleagues’ experiment, 


the helium target was cooled to a cryogenic 
temperature of about 2 kelvin. This allowed 
some pions to be captured in a weakly bound 
state of pionic helium, in which the pion was 
sufficiently far from the nucleus to be shielded 
from it by the remaining electron (Fig. 1). The 
resulting exotic atom therefore retained a life- 
time of nanoseconds, which is long enough for 
alaser pulse to excite the nascent exotic atom. 

Using a short pulse (0.8 nanoseconds) of 
infrared laser light, Hori and colleagues pro- 
voked atransition of the pion. This resulted in 
the ejection of the remaining electron, leaving 
ashort-lived system consisting of just a pion 
bound toa helium nucleus. The pion was then 


“Theauthors prepared 
pionic helium atoms by firing 
abeam of pion particles ata 
liquid-helium target.’ 


absorbed by the nucleus, leading to the latter’s 
break-up (fission). 

The pion transition was detected by 
carefully removing large ‘background’ signals 
from the experimental data; this background 
was associated with fission products from 
short-lived states of pionic helium, or was 
generated by the pion beam itself. That left 
a signal from just three transitioning pionic 
helium atoms per hour, or an estimated three 
per billion exotic atoms produced. Despite the 
challengingly low numbers, the laser-induced 
signal for the transition was clearly detected, 
and the laser frequency at which it occurred 
(and which corresponds to the energy change 
of the transition) could be determined with an 
absolute precision of about five significant 
figures. 

This result builds on Hori and colleagues’ 
extensive experience of studying another 
exotic helium-type atom, in which one elec- 
tron was replaced by an antiproton’. That 
work resulted in, among other things, the most 
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precise determination so far of the ratio of the 
mass of the antiproton to that of the electron. 
However, the authors had to overcome further 
challenges to study pionic helium. 

For example, the lifetimes of pionic helium 
atoms are shorter than those of antiprotonic 
helium and the widths of lines in its spectrum 
are broadened, because the excited exotic 
atoms undergo many collisions in the rela- 
tively dense liquid-helium target. It might be 
possible to reduce the density of the liquid 
helium to resolve these somewhat incalcula- 
ble effects, although this would also lower the 
signal rate. It remains to be seen whether extra- 
polation to zero density, where collisional 
effects are minimized, is practicable. 

Nevertheless, Hori and colleagues’ work 
opens up a whole new experimental system 
for further exploration. If the above challenges 
can be overcome, such exploration might ena- 
ble the accuracy of the mass of the negative 
pion to be improved by a factor of 10-100, for 
instance; currently, this mass is known to only 
six decimal places. The experiment thus paves 
the way to fresh insights into the fundamental 
constituents of nature. 


Niels Madsen is in the Department of Physics, 
Swansea University, Swansea SA2 8PP, UK. 
e-mail: n.madsen@swansea.ac.uk 
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Pancreatic cancer 


Hidden in plain sight 


Anirban Maitra 


Pancreatic cancer does not respond to certain anticancer 
treatments that boost immune responses. A mechanism active 
in tumour cells that contributes to this evasion of immune 
targeting has been uncovered. See p.100 


A consistent hallmark of pancreatic cancer is 
the inability to treat it with immunotherapy — 
an approach that harnesses the body’s immune 
response to target a cancer. On page 100, 
Yamamoto et al.’ reveal a mechanism that 
enables pancreatic cancer cells to evade an 
immune response. The process implicated 
is usually associated with the normal deg- 
radation and recycling of cellular proteins. 
The authors find that inhibition of this path- 
way, using drugs or by genetic approaches, 
reverses this immune evasion in animal models 
of pancreatic cancer. The finding provides a 
compelling rationale for investigating whether 
targeting this pathway might be of benefit in 
the clinic. 

The past decade has seen striking advances 
in the use of immunotherapy to treat numer- 
ous solid cancers (those not formed from 
blood cells) that had not responded to earlier 
therapy attempts. One such advance is the 
development of what is known as checkpoint 
blockade therapy. This targets the proteins 
PD-1 and CTLA-4, which are found on the 
surface of immune cells called cytotoxic 
T cells (also called CD8 T cells) and inhibit 
an immune response. The administration of 
two antibodies that, respectively, target these 
two inhibitory proteins is used widely to drive 
an antitumour immune response’. Unfortu- 
nately, treatment of pancreatic cancer with 
either antibody, or the two together, has not 
led to any notable success in terms of patient 
survivaP’. 

There are many factors that cancontribute 
to the failure of immunotherapy. The tumour 
microenvironment of pancreatic cancer 
contains a variety of immune cells that sup- 
press the function of cytotoxic T cells — they 
include myeloid cells, tumour-associated 
macrophages and regulatory B cells*. More- 
over, signals released by the cancer cells 
themselves (often mediated by the aberrant 
activation of tumour-promoting genes such 
as those encoding the proteins Ras and Myc) 
have a key role in creating this profoundly 
immunosuppressive environment’. In 
addition, fibroblast cells inthe tumour micro- 
environment secrete material that generatesa 
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physical barrier hindering the influx of T cells 
to the tumour site’. 

Another equally crucial cause of immune 
evasion is that the cancer cells themselves 
undergo changes. An immune response is 
triggered when aT cell recognizes as foreign 
a peptide fragment (called an antigen) that 
is ‘presented’ on the surface of acancer cell, 
bound to a molecule called a major histo- 
compatibility complex (MHC) classI molecule 
(Fig. 1). Aninability of the MHC molecules to 
present tumour antigens has emerged’ as 
an explanation for how cancer cells can hide 
from the immune system. These molecules 
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comprise an invariant B,-microglobulin pro- 
tein (encoded by the B2M gene) and a protein 
encoded by HLA genes, which vary in the anti- 
gen-binding sitethey encode, thereby enabling 
different antigens to be bound by different 
MHC molecules. Tumour-derived-antigen 
presentation by MHC class I molecules is 
required for cancer cells to be recognized by 
the T-cell receptor on cytotoxic T cells. 

The inability to present antigens, and thus 
the absence of immune recognition, can occur 
through alterations in the genes encoding 
proteins needed for antigen presentation, 
such as B2M and HLA genes*”. In pancreatic 
cancer, such genetic alterations are relatively 
uncommon, occurring in no more than 1% of 
cases’’. However, lower-than-normal levels of 
MHC class! molecules, ortheircompleteloss, 
occurs in more than 60% of cancers that arise 
in the pancreas”, and the decrease might be 
even greater in metastatic tumours (those that 
have spread beyond the pancreas)’. Until now, 
the mechanisms underlying the regulation of 
MHC class I molecules in pancreatic cancer 
have remained elusive. 

Yamamoto and colleagues reveal that a 
cellular pathway called autophagy is the 
means by which pancreatic cancer cells 
limit the amount of MHC class I molecules 
on their surface, thereby hindering antigen 
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Figure 1| Boosting the targeting of pancreatic cancer by the immune system. The ability of immune cells 
called cytotoxic T cells to attack tumours is thwarted by, for example, the action of inhibitory proteins called 
CTLA-4 and PD-1. Yamamoto et al.' report a previously unsuspected mechanism that enables pancreatic 
cancer to evade immune cells. a, The T-cell receptor (TCR) onthe surface of cytotoxic T cells enables them 
to recognize tumour cells, and this recognition depends on the presence of major histocompatibility 
complex (MHC) class I molecules on the surface of cancer cells. Yamamoto and colleagues report that 

MHC class! molecules in pancreatic cancer cells are destroyed by a process termed autophagy. This 

begins when the molecule binds to the protein NBR1 and is enveloped in a membrane to form an organelle 
called an autolysosome. The destruction of MHC class I molecules thus prevents them from ‘presenting’ 
peptide fragments (antigens) from the tumour that might be recognized by the TCR. b, In mouse models 

of pancreatic cancer, treatment combining an autophagy-inhibiting drug (chloroquine) and antibodies 
targeting PD-1 and CTLA-4 (termed checkpoint blockade therapy) provoked a robust immune response 
against the tumour compared, with the case for animals that did not receive such treatment. 
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presentation. Autophagy is an essential 
cellular degradation pathway that recycles 
organelles and proteins to maintain cellular 
‘fitness’. It can act selectively through the 
binding of a specific receptor to a ‘cargo’ (for 
example, a protein or organelle) that has been 
targeted for destruction by being marked with 
atag, suchas the protein ubiquitin. Acomplex 
of receptor and cargo is enveloped in a lipid 
membrane to forma vesicle called an autopha- 
gosome, which fuses with an organelle known 
as alysosome to form an autolysosome (Fig. 1). 
The cargo then undergoes enzyme-mediated 
digestion inthe autolysosome andits contents 
are recycled for use inthe cell. 

Yamamoto et al. report that, remarkably, 
in pancreatic cancer cells, most MHC class I 
molecules do not exist onthe cell surface, but 
instead are found in auttophagosomes and 
autolysosomes. The authors identified 
an autophagy-associated receptor called 
NBRI1 as being responsible for targeting 
MHC class I molecules to the autophagy 
machinery. Furthermore, they found that if 
autophagy was inhibited in mice, by drugs 
suchas chloroquine or through genetic engi- 
neering, this restored the surface expression 
of MHC class I molecules, thereby enhanc- 
ing antigen presentation. In mouse models 
of pancreatic cancer, autophagy inhibition 
resulted in an influx of cytotoxic T cells to the 
tumour microenvironment, andifthe animals 
also received checkpoint blockade therapy, 
a robust antitumour immune response was 
generated. 

Increased autophagy has been known for 
around a decade™ to be a metabolic require- 
ment for pancreatic cancer, but only now 
has a connection been made to the immune 
evasion of tumour cells. Thus far, clinical 
trials targeting autophagy in pancreatic 
cancer have relied on testing the antimalarial 
drug hydroxychloroquine (chloroquine and 
hydroxychloroquine are related molecules), 
which blocks one of the final steps in auto- 
phagy. Other drug candidates, directed 
at earlier components of the autophagy 
machinery, are in the pipeline. Early trials 
of hydroxychloroquine demonstrated only 
modest results, but there has been a resur- 
gence of interest in combinatorial treatment 
approaches after evidence from animal mod- 
els that, if signalling mediated downstream 
of mutant Ras by the enzyme MAP kinase is 
inhibited, pancreatic cancer cells become 
strongly dependent on autophagy for their 
survival’*, 

Yamamoto and colleagues’ work will almost 
certainly lead to further additions to the com- 
pendium of autophagy-targeted clinical trials 
of pancreatic cancer treatments. Discoveries 
in the fields of autophagy and immunother- 
apy were, respectively, recognized by the 
Nobel Prize in Physiology or Medicine in 
2016 and 2018. This new finding represents 


an unprecedented opportunity for the con- 
vergence of these two areas of study, in efforts 
toimprove therapies for pancreatic cancer. 
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Mass spectrometry for 
future atomic clocks 


Marianna S. Safronova 


Highly charged ions could form the basis of the next 
generation of ultra-precise clocks, using electronic transitions 
in the ions as the ‘pendulum’ An ingenious method for 
characterizing such transitions has been reported. See p.42 


Atomic clocks, which use transitions between 
the energy levels of electrons in atoms as a 
reference for their timekeeping mechanism, 
are the world’s most accurate clocks — they 
will not lose one second during the lifetime 
of the Universe’. This means that they can be 
used in ultra-precise measurements to probe 
some of the fundamental postulates of mod- 
ern physics. Clocks based on highly charged 
ions (HCIs; atoms from which many electrons 
have been removed) are predicted to have 
even more sensitivity in these investigations’. 
However, the development of such clocks 


“The authors used Einstein’s 
principle of energy-mass 
equivalence toconverta 
mass measurement into an 
energy measurement.” 


is hampered by the difficulty of detecting 
suitable transitions in HCls. 

On page 42, Schiissler et al.? report that they 
have measured a long-lived, excited electronic 
state ina highly charged rhenium ion using the 
mass difference of the ion in its ground and 
excited states. This non-destructive, direct 
determination of an electronic excitation in 
an HCI will aid the discovery of HCI transitions 
that would be suitable for use ina clock. 

To build a clock, one needs a periodic 
event whose frequency acts as areference for 
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timekeeping. Electronic transitions in atoms 
are perfect natural oscillators for this purpose. 
Anultra-stable laser must be tuned to the exact 
frequency of the atomic transition to drive the 
oscillation, much asa musical instrument must 
be tuned to produce the right tone. 

Can just any atomic transition be used? No— 
suitable transitions are hard to come by. The 
best transitions start from the lowest energy 
state of an atom (the ground state) and must 
end up ina long-lived (metastable) excited 
state. The energy needed to stimulate the 
transition must also be within the range of 
tabletop-laser technologies. 

Moreover, the atoms must be held in traps, 
so that their motion is almost completely 
frozen — in other words, the operation of 
atomic clocks requires precision manipu- 
lation of quantum systems. For this reason, 
currently available clocks use transitions 
either in electrically neutral atoms or in ions 
produced by removing one electron from an 
atom, because these systems are the most 
amenable to precision quantum control. 

Substantial advances have been made 
in studies of HCIs, and all the technologies 
required to make a clock using such anion 
were demonstrated only this year*. However, 
progress is hindered by the difficulty in using 
conventional atomic spectroscopy to iden- 
tify and measure transitions suitable for use in 
clocks — the characteristics of such transitions 
mean that they are, by definition, very weak 
(the probability of the transition occurring 


Nature | Vol 581 | 7 May 2020 | 35 


News & views 


First position 


Frequency measurements 


" . \ 
\ Hjz=~ (~~ \ ~~ \ | 
mi | 
Trap 1 Trap 2 Trap 3 Trap 4 Trap 5 
Second position 
© “| Y 
y 


Figure 1 | Mass measurements of a highly charged ion. Schiissler er al.? used an instrument called 
PENTATRAP, which consists of five stacked ion traps, to determine how the mass of the rhenium ion Re””* 
differs in its ground state (blue ions) and ina metastable excited state (red ions). The authors captured three 
ions in their first position (traps 2-4) at 4 kelvin, and simultaneously measured the cyclotron frequency — 
the frequency of anion’s motion in a magnetic field — of the ions in traps 2 and 3. All three ions were then 
shifted to their second position, so that the ions in traps 2 and 3 had different states from the ones that 

were trapped in the first position. The cyclotron frequencies of the ions in those traps were measured, 

and the whole sequence was repeated many times. From the ratio of cyclotron frequencies of ions in the 

two different states, the authors determined the associated difference in ion mass, and thus the change of 


energy that occurs when a Re?* 


conventional atomic spectroscopy. 


is small). Schiissler et al. therefore used a 
completely different and ingenious method 
to measure the energy change that occurs 
during a weak transition in a highly charged 
rhenium ion (Re?*): they used Einstein’s 
famous principle of energy—mass equivalence 
(E= mc?) to convert amass measurement into 
an energy measurement. 

The basic idea is to trap a single ion ina 
Penning trap, a device that confines charged 
particles using magnetic and electric fields. 
The mass of an ion in a Penning trap can be 
determined by measuring the frequency of the 
ion’s motionina magnetic field (the cyclotron 
frequency). The binding energy of an atom or 
ion — the energy required to break the atom 
into its free electrons and a nucleus — is dif- 
ferentin an excited metastable state from that 
in the ground state. The mass therefore also 
changes, which, in turn, alters the cyclotron 
frequency. 

In their experiments, Schissler et al. 
measured the ratio (R) of the cyclotron 
frequency of Re” in the ground state and the 
metastable state. Because the difference in 
energy of the two states in Re””* is extremely 
small compared with the total energy of the 
ion, the precision of the measurement needs 
to be extraordinarily high. The authors meas- 
ured Rtoa precision of 10™, using a device 
known as PENTATRAP. 

PENTATRAP consists of a stack of five 
Penning traps cooled to atemperature of 4 kel- 
vin (Fig. 1). Traps 2 and 3 are used to measure 
cyclotron frequencies, whereas traps 1 and 4 
are used to store ions. Trap 5 was not used in 
the current experiments, but will allow moni- 
toring of fluctuations inthe magnetic field and 
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ion transitions between states. Such a measurement would be difficult using 


in other experimental variables in the future. 

The authors loaded three ions into the 
innermost traps, so that theions in traps 2 and 
4wereinthesamestate (either the metastable 
state or the ground state), and theionintrap 3 
was in the alternative state. First, they meas- 
ured the cyclotron frequencies of the ions in 
traps 2 and 3 simultaneously. They then moved 
the three ions up by one trap, effectively 
swapping the states of the ions in traps 2 and 
3 (the states of the ions did not change, only 
their positions; Fig. 1), and simultaneously 
measured the cyclotron frequencies of those 
ions. The three ions were moved back down 
by one trap, and the sequence began again. 
Overall, the electronic states in traps 2 and 3 
were repeatedly swapped, and simultaneous 
measurements were taken after each swap. 

This experimental procedure, combined 
with the design of the PENTATRAP device, 
suppresses the effect of magnetic-field vari- 
ations onR, thus allowing R to be determined 
with high accuracy. The energy difference 
between the ground and excited states can 
then be calculated using R and the ion mass 
ina variant of Einstein’s equation; the actual 
mass of the ion needs to be known only toa 
precision of 10+. 

This first demonstration of the method 
opens up exciting possibilities for measuring 
the transition energies in HCls that are difficult 
to measure using conventional approaches. 
Moreover, the energy change measured by 
Schiissler and colleagues is in excellent agree- 
ment with that predicted from the authors’ 
advanced theoretical calculations. This agree- 
ment demonstrates that theory can be used to 
predict the transition energies in HCls, thereby 


© 2020 Springer Nature Limited. All rights reserved. 


facilitating the discovery of more transitions. 

The transition energy measured in the 
current work corresponds to a frequency 
that lies outside the range of lasers that can 
be used ina clock. However, the authors note 
that it should be possible to use their method 
to measure transitions that have lower fre- 
quencies suitable for clock development in 
the near future. 

Clocks based on HCI transitions are 
particularly attractive because they could 
be used in stringent tests that are sensitive 
enough to detect physics beyond the stand- 
ard model of particles and interactions — such 
as variations of fundamental physical con- 
stants and violations of Lorentz invariance” 
(a cornerstone of physics that acts as the 
mathematical foundation for Einstein’s spe- 
cial theory of relativity). Such clocks would 
also be particularly sensitive to the effects of 
ultralight dark matter?®, one of the candidates 
for the ‘missing’ matter in the Universe. Tre- 
mendous progress in the control of HCIs has 
been made in the past few years”**, paving the 
way towards these applications. The precision 
mass spectrometry enabled by PENTATRAP 
will also have other valuable applications’, 
such as in tests of the energy—mass equiva- 
lence principle, experimental determinations 
of the upper limits of the mass of neutrino 
particles, and tests of quantum electrodynam- 
ics, the theory that describes the interactions 
between particles and light. 
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® Check for updates 


Charged pions’ are the lightest and longest-lived mesons. Mesonic atoms are formed 
when an orbital electronin an atom is replaced by a negatively charged meson. Laser 
spectroscopy of these atoms should permit the mass and other properties of the 
meson to be determined with high precision and could place upper limits on exotic 
forces involving mesons (as has been done in other experiments on antiprotons? ’). 
Determining the mass of the 7 meson in particular could help to place direct 
experimental constraints on the mass of the muon antineutrino”” °. However, laser 
excitations of mesonic atoms have not been previously achieved because of the small 
number of atoms that can be synthesized and their typically short (less than one 
picosecond) lifetimes against absorption of the mesons into the nuclei’. Metastable 
pionic helium (‘He’) is a hypothetical’ * three-body atom composed of a helium-4 
nucleus, an electron anda occupying a Rydberg state of large principal (n=16) and 
orbital angular momentum (/~ n- 1) quantum numbers. The ‘He’ atom is predicted 
to have an anomalously long nanosecond-scale lifetime, which could allow laser 
spectroscopy to be carried out”. Its atomic structure is unique owing to the absence 


of hyperfine interactions’*”’ between the spin-O m7 and the *He nucleus. Here we 
synthesize m*He’* ina superfluid-helium target and excite the transition 

(n, 2) = (17,16) > (17, 15) of the 1 -occupied m*He’ orbital at a near-infrared resonance 
frequency of 183,760 gigahertz. The laser initiates electromagnetic cascade processes 
that end with the nucleus absorbing the 7 and undergoing fission?°”", The detection 
of emerging neutron, proton and deuteron fragments signals the laser-induced 
resonance in the atom, thereby confirming the presence of m*He’*. This work enables 
the use of the experimental techniques of quantum optics to study a meson. 


Ameson consists of a valence quark and an antiquark that are bound 
by the strong interaction. Following the 1932 discovery of the neutron, 
the existence of mesons was predicted” in 1935 as the quanta associ- 
ated with the hypothetical field that gives rise to the strong force 
binding protons and neutrons into atomic nuclei. From the apparent 
femtometer-scale range of the nuclear force, which was assumed to 
be mediated by the exchange” of mesons, the new particle was pre- 
dicted to be 200 times more massive than the electron. In 1947, 
charged pions having roughly the expected mass became the first 
mesons to be discovered“ among the secondary particles produced 
by cosmic rays. In the same year, Fermi and Teller suggested” that 
various exotic atoms and ions may be formed when one or more elec- 
trons in anormal atom are replaced by a heavy, negatively charged 
particle such asa . Whereas isolated 7 mesons decay via the weak 
interaction witha lifetime” of r= 26 ns into a negatively charged muon 
(w) and a muon antineutrino (¥,), m™ mesons that occupy excited 
atomic states typically undergo rapid (<1 ps) electromagnetic cascade 
processes” and deexcite to s, p or d atomic states. As these low-/ 
orbitals have a large overlap with the atomic nucleus, the 7 is 
absorbed by the nucleus via the strong interaction. Metastable m*He* 
atoms have nevertheless been predicted with a lifetime more than 
1,000 times longer than those of other pionic atoms. The long 


lifetime arises because the large binding energy of helium (/, =25 eV) 
and the high multipolarity of the transition (A/> 3) inhibit the deex- 
citation of the m*He* by Auger emission of the 1s electron *. The 
remaining electron protects the stability of the 7 orbital and prevents 
Stark mixing with the low-/ orbitals during the many collisions with 
normal helium atoms that the m*He* makes during its lifetime. This 
extreme longevity (for a mesonic atom) makes He’ a favourable 
candidate for laser excitation”. 

For context, laser spectroscopy has previously been achieved for 
three-body antiprotonic helium (pHe' = p + He2* + e) inwhich ananti- 
proton p occupies ann ~38 orbital witha microsecond-scale lifetime>’. 
This allowed the determination® of the antiproton-to-electron mass 
ratio with a fractional precision of 8 x 10°. On the other hand, the 
smaller mass of m1 (seven times smaller than that of p) means that 7*He* 
is much less stable and amenable to direct detection than pHe’, which 
resembles a diatomic molecule”*. Consequently, no atomic lines of 
mHe* have been observed so far. The existence of this atom has been 
indirectly inferred from experiments” * in which the lifetime against 
nuclear absorption of 7 mesons brought to rest in helium targets was 
estimated to be r= 300 ps-7 ns. These values cannot be easily inter- 
preted because they differ from the expected Auger lifetimes of the 
He’ states’, 
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Fig. 1| Energy level diagram of mHe’. On the left-hand scale the theoretical 
absolute energy of each m‘He’ state with principal and angular momentum 
quantum numbers (n, /) is plotted relative to the three-body breakup threshold. 
The wavy lines indicate Auger-dominated states with picosecond-scale 
lifetimes, and the solid lines show metastable levels with theoretical lifetimes 
greater than 10ns. The indicated lifetimes were calculated” as the reciprocal of 
the sum of the Auger and radiative decay rates of each state and the intrinsic 
3.8 x10’ s weak decay rate of 1. Dashed lines indicate the two-body m‘He** 
ionic states composed of a helium nucleus and a7, and curved arrows show 
some Auger transitions with multipolarity Al=2.We searched for the laser 
transitions indicated by the straight arrows. 


The spectroscopic signal that would provide unambiguous evidence 
for the existence of m*He* can be observed only if a substantial popula- 
tion of 7 mesons were to occupy the states interrogated by the laser”. 
Nothing is experimentally known about the quantum numbers of these 
populated states, but we assumed” © that n would be distributed 
around ng = ./M*/m, ~16, where M denotes the reduced mass of the 
pion-nucleus system and m, is the electron mass. This corresponds to 
ahighly excited 7 orbital with the same radius and binding energy as 
those of the replaced electron. Figure 1 shows the energy level diagram 
of mHe’, obtained from three-body quantum electrodynamics calcu- 
lations” that include relativistic and radiative recoil corrections of 
order m,a*, where a denotes the fine-structure constant. The 
complex-coordinate-rotation method allows the energies of states 
with even large Auger widths (/, = 30 GHz) to be calculated*” with a 
relative precision of <10°°. The precision of the theoretical transition 
frequencies v,, are limited by the 10 *-scale experimental uncertainty?” 
of them mass. The effects of collisions”? with helium atoms, which 
may destroy some m*He’ states, are difficult to estimate and were not 
included in the calculated lifetimes shown in Fig. 1. 

We used sub-nanosecond laser pulses to induce a transition 
froma ‘He’ state with a nanosecond-scale lifetime to a state with a 
picosecond-scale lifetime against Auger decay”. The two-body pionic 
helium ion (7*He”* = m+ *He**) that remained after Auger emission 
was destroyed in atomic collisions that induced Stark mixing between 
Rydberg and low-/ orbitals. The resonance condition between the m*He* 
and the laser was detected as a peak in the total production rate of neu- 
trons, protons and deuterons that emerged from the resulting nuclear 
absorption. We resolved this signal, which was superimposed ona back- 
ground of m*He* atoms that spontaneously decayed with a lifetime” 
of ~7 ns, by measuring the nuclear fission products using scintillation 
counters with nanosecond-scale timing resolutions. Larger disper- 
sions inthe arrival times of the 7 and laser pulses at the experimental 
target, or fission products at the detectors, would cause the signal to 
broaden and become indistinguishable from the background, which 
was up to 10‘ times larger. 

The experiment was carried out at the mES beamline (see Fig. 2a) of 
the 590-MeV ring cyclotron facility*** at the Paul Scherrer Institute 
(PSI), which provided am beam with momenta between p= 83 MeVc? 
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and 87 MeVc ‘with the highest average intensity N= 2-3 x10’ s ‘avail- 
able worldwide. The large contamination (>3 x 10° s“) of electrons in 
the beam was removed using a550-kVE x B Wien filter. The purified 1 
beam arrived at the ‘He target at timings spaced by At=19.75 ns, which 
arose from the f, = 50.63 MHz accelerating radiofrequency of the cyclo- 
tron. Each radiofrequency cycle contained on average N,,/f, = 0.4-0.6 
™ mesons. The w mesons were distinguished (see Fig. 3a) from the 
residual e- and yr by their different time of flight from the graphite 
target*° where the particles were created, as well as by the characteristic 
energy deposition AF of the particles traversing asegmented scintilla- 
tion counter positioned upstream of the ‘He target. 

Around 2.3% of the 7 mesons that came to rest in the superfluid-*He 
target (length 150 mm, diameter 42 mm, temperature 7 = 1.7 K; 
Fig. 2b) were assumed” to form He’. This implies a production rate 
of >3 x 10°s*, which ensures a 10*-scale probability of coincidence” 
between the laser pulse and the m*He* atom. A laser beam of diameter 
d=25mm, pulse energy £, =10 mJ, pulse length At= 800 ps, repetition 
rate f,=80.1Hzand wavelengths A =1,515-1,634 nm entered the target 
chamber through a fused-silica window. The absence of bubbles in 
the superfluid *He allowed the laser light to reach the m*He* without 
considerable scattering. 

The 1-3 neutrons and the proton or deuteron that typically emerged 
fromam absorption followed anticollinear?°”' trajectories and had 
kinetic energies distributed between a few megaelectronvolts and 
E=60-80 MeV. The arrival times t, and energy depositions AF of these 
fission products were measured (Fig. 3b) by an array of 140 plastic 
scintillation counters covering a solid angle of 2m steradians as seen 
from the target. This high segmentation and the high-speed waveform 
digitization” of the signals allowed us to resolve individual mHe* events 
despite the high intensity of the 7 beam. Each 40 x35 x 34 mm? counter 
inthe array provided a<10% detection efficiency to signal neutrons’”** 
of kinetic energy £2 25 MeV. The background electrons that arose from 
the beam or the decay of yw in the experimental apparatus deposited 
an average energy of AF = 6-8 MeV ina counter. Most of these were 
removed by rejecting events with AF < 20-25 MeV. 

The histogram in Fig. 3c shows the distribution of the arrival times 
of events at the 140 scintillators measured in this way without laser 
irradiation. The zr arrivals at t= 0 and t=19.75 ns produced peaks rep- 
resenting the more than 97% of 7 mesons that underwent immediate 
nuclear absorption. The remaining (2.1+ 0.7)% fraction gave a continu- 
ous spectrum that decayed with a lifetime of t= 7 +2 ns in the intervals 
between 7 arrivals. This roughly agrees with the results of a Monte 
Carlo simulation” for the signal expected from mHe’ and witha previ- 
ous experiment” that used a liquid helium target. 

The laser pulses irradiated the target at t=9 ns after the arrival of the 
1 mesons with atypical timing jitter of At<1ns. For this, we developed 
an injection-seeded, optical parametric generation (OPG) and ampli- 
fication (OPA) laser system” and a pump laser based on an amplified 
microchip laser (Fig. 2c), which was precisely synchronized with the 
radiofrequency of the cyclotron. 

We initially assumed that the states n= 16 were the most populated and 
searched for the transition (n, J = (16, 15) > (17, 14). A continuous-wave 
pulse-amplified dye and Ti:sapphire hybrid laser was used to irradi- 
ate the *He target with 700-ps-long laser pulses of energy E, = 7-8 mJ 
and diameter d=20 mm. The laser was scanned over a200-GHz region 
around the calculated” transition frequency of v,,=781052.6(2.0),, GHz. 
The theoretical (‘th’) uncertainty of 2.0 GHz mostly arises from the 
uncertainty of the 7 mass” used inthe calculation. No significant signal 
was observed. Subsequent calculations revealed that the coupling of 
the daughter state (17, 14) to an electronically excited m*He’ state gives 
rise to large scalar and tensor polarizabilities of 4 x 10* and 70 atomic 
units, respectively”. This can destabilize state (17, 14) against atomic 
collisions and prevent the detection of the resonance. 

We next searched for (16, 15) > (16, 14) at A=1,515.3 nm using the 
OPG-OPA laser. The 250-fs lifetime” of the (16, 14) state implies a large 
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Fig. 2| Experimental layout. a, Schematic layout of the 7 beamline**>. The” 
mesons are produced by allowing a proton beam of power P=1.2 MW and 
repetition rate f,=50.63 MHz to traversea graphite target*®. 7 mesons with 
momentum p= 83-87 MeV c‘and intensity N, = 2-3 x 10’ s‘are transported bya 
magnetic beamline to the position of asuperfluid-helium target. An£ x B Wien 
filter removes thee contamination from the beam. b, Layout of the 
experimental target. The beam traverses asegmented scintillation counter 
before coming to rest in the helium target, thereby producing m*He* atoms. The 
atoms are irradiated with laser pulses with length At=800 ps, energy £,=10 mJ, 
diameter d=25 mm and wavelengthA =1,515-1,634 nm. The proton, neutron 


width of [, = 640 GHz for the resonance. Data accumulated over 300h 
corresponding to more than 6 x 10’ detected m arrivals showed no 
substantial signal. The reason for this is not understood, but atomic 
collisions may destroy the 7 population occupying state (16, 15), as 
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Fig. 3|Detection ofm arrivals and nuclear absorptions inthe helium 
target, and time spectra of the emerging fission fragments measured with 
and without laser irradiation. a, Contour plot showing the correlation 
between the arrival times t, and energy depositions AF of particles measured 
by ascintillation counter positioned at the entrance of the experimental target. 
The particles arrived at timings spaced by At=19.75 ns. The particle type (7, uw 
ore’) was identified, and them events in the dashed rectangle were selected. 
a.u., arbitrary units. b, t,-AF contour plot of emerging fission fragments that 
strike the scintillator array following the 7 absorption by helium nuclei. 
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and deuteron fragments”°” that emerge from the mw absorptioninthe helium 
nuclei are detected by anarray of 140 plastic scintillation counters surrounding 
the target. c, Schematic layout of the laser system (not to scale), whichis based 
onacontinuous-wave (cw) external-cavity diode laser (ECDL). OPG® and OPA 
are carried out inmagnesium-doped periodically poled lithium niobate 
(MgO:PPLN) and potassium titanyl phosphate (KTP) crystals to produce the 
laser pulses that irradiate the m*He* atoms. The laser beams used to pump the 
crystals are produced by amplifying the output of a microchip laser of 
A=1,064nm in neodymium-doped yttrium aluminium garnet (Nd:YAG) fibre 
and rod crystals (see text). 


observed“ for several pHe' states. Alternatively, it is possible that a 
very small 7 population is captured into (16, 15). 

We shifted our search to the transition (17, 16) > (17, 15) inthe n=17 
region. This resonance with an amplitude” of M = 1.4 atomic units 
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Background electrons of AE < 20-25 MeV and fission products of low velocity 
were removed by accepting events with energy depositions between 
AE=20-25 MeVat the lower limit and AF 2 60 MeVat the higher limit inthe 
dashed rectangle. c, Time spectra of fission fragments measured with (filled 
circles with error bars) and without (histogram) laser pulses of wavelength 
A~1,631.4 nm irradiating the target at t= 9 ns. The peak at t=9 ns inthe 
former spectrum corresponds to the resonance signal of the transition 

(n, ) = (17, 16) > (17, 15) of metastable He’. Error bars, 1s.d. 


Nature | Vol581 | 7 May 2020 | 39 


Article 


1,633.04 nm 


oO 


Oo N 


Normalized signal intensity (x10~4) 


1,629.93 nm 


E it 
: 8 10. 12 


Time elapsed (ns) 


oO 


6 14 


Fig. 4| Laser resonance signals of the transition (n, [) = (17, 16) > (17,15) in 
metastable m*He’ atoms. a-d, Normalized time spectra of the resonance 
signal measured at four laser wavelengths. The plots are obtained by taking the 
difference between the timing distributions of m absorption in*He nuclei 
measured with and without laser irradiation. When the laser is tuned toa 
wavelength ofA ~ 1,631.4 nm, the transition (17, 16) > (17, 15) is excited anda peak 
is generated at t=9.5ns at the rate of 7 absorptions. Error bars, 1s.d.e, Profile 
of the resonance measured by scanning the optical frequency of the laser overa 


should readily saturate at our laser fluence of ¢=2 mJ cm”. The time 
spectrum indicated by the filled circles in Fig. 3c represents 2.5 x 10’ 
1 arrivals detected over 100 h, with the laser wavelength tuned to 
A=1,631.4 nm. A sharp peak was observed at t= 9 ns containing 300 
events with a signal-to-noise ratio of 4 and a statistical significance of 
>7 s.d., in coincidence with the laser pulses. The width At=2 ns of the 
peak is commensurate with the dispersions in the times of flight of the 
fission fragments arriving at the scintillator array. The measured detec- 
tion rate of 3h” of resonant m*He’ is compatible with the production 
rate of >3 x 10°s of the atoms and with Monte Carlo simulations” that 
assume that most of the metastable population lies in the (17, 16) state. As 
expected, detuning the laser frequency away from the m*He’* resonance 
(Fig. 4a-d) caused the signal to gradually decrease and finally disappear. 

By scanning the laser frequency over a 500-GHz-wide region and 
plotting the relative number of counts under the laser-induced peak, 
we obtained the resonance profile of Fig. 4e. The statistical uncertainty 
of the signal is indicated by the error bars. The -100-GHz width of the 
resonance roughly agrees with a convolution of the calculated” Auger 
width /, = 33 GHz of state (17, 15), collisional® and power broadening 
(~-50 GHz) and the linewidth (-10 GHz) of the seeded narrowband com- 
ponent of the laser beam. Additional width may be due to atomic col- 
lisions that shorten?” the lifetime of (17, 15). 

The3.0-GHz spacing” between the fine-structure sublines that arise 
from the interaction between the electron spin and the orbital angular 
momentum ofr is small compared to the 33-GHz natural width of the 
resonance. The best fit (blue curve) of two overlapping Lorentzian func- 
tions that take the sublines into account havea reduced y’ value of 1.0. 
Lorentzian lineshapes are expected” for resonances that are homoge- 
neously broadened by the natural width, binary atomic collisions and 
power-broadening effects. The resonance centroid was determined 
AS Vexp = 183760(6) sar(6) syst GHZ. The statistical (‘stat’) uncertainty of 
6 GHz is due to the finite number of detected m*He’* atoms. The sys- 
tematic (‘syst’) uncertainty of 6 GHz contains contributions related 
to the selection of the fit function (5 GHz), the calibration of the laser 
frequency against the absorption lines of °C’°O molecules*! (0.8 GHz) 
andthe uncertainty” related to the OPG and OPA processes (3 GHz). 

The fact that v,,, is larger than the calculated value” of 
Vin = 183681.8(0.5),, GHz by Av= 78 GHzis believed to be due to atomic 


40 | Nature | Vol581 | 7 May 2020 


Normalized signal intensity (10-5) 


3.0 


2.5 


a 
——e 


500 600 700 800 900 
Laser frequency — 183,000 (GHz) 


FTITT 


1,000 


500-GHz-wide region and plotting the normalized number of counts under the 
peaks. Black filled circles indicate experimental data points with 1-s.d. error 
bars, the blue line is the best fit of a double Lorentzian function, and the red 
arrow indicates the position of the spin-averaged transition frequency 
obtained by a three-body quantum electrodynamics calculation”. The 
systematic uncertainty” of the laser frequency arising from the OPG and OPA 
processes is +3 GHz. 


collisions that perturb the energy levels and shift the resonance 
frequency”, as has been observed” for pHe’. The effect was calcu- 
lated using the impact approximation of the binary collision theory 
of spectral lineshapes*. The gradient of the shift at a target tem- 
perature of T=4 K was predicted to be dv/dp =4.4-6.5 x 107! GHz cm?, 
depending on the interatomic potential and model used. At the 
superfluid target density, p = 2.18 x 10” cm, this corresponds toa 
blueshift of Av = 96-142 GHz, which roughly agrees with the above 
experimental result despite the complexity of the multi-body 
calculations. 

We have measured the resonance frequency of Vx) = 
183760(6) <:a:(6)syst GHz of the transition (17, 16) > (17, 15) for 1t*He* pro- 
duced in a superfluid-helium target, thus demonstrating laser spec- 
troscopy of an atom containing a meson. The shift of v,,, caused by 
collisions must be measured before we can determine the 7 mass. We 
plan to search for resonances suchas (17, 16) > (16,15) that are predicted 
to be narrower by a factor of at least 10°. By measuring v,,, at various 
densities of a helium gas target, the zero-density value may be deter- 
mined*°. Although it is difficult to predict how precisely the 7 mass 
may ultimately be determined, the natural width of some resonances 
implies a fractional precision of <10 °°, if the m*He* atom could be syn- 
thesized in targets of sufficiently low density as in the pHe’ case®”’. 
Along with H,* and HD* molecular ions**“* and pHe'atoms, metastable 
1He* is now among the fundamental three-body systems containing 
two massive particles that can be precisely studied by laser 
spectroscopy. 
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Methods 


Pion beam 

A proton beam* of energy £ = 590 MeV and power P = 1.2 MW was 
allowed to traverse a rotating graphite target” of thickness tj=40 mm, 
thereby producing secondary 7, and e (see Fig. 2a). Particles with 
momenta between p = 83 and 87 MeV c ‘were collected by magnetic 
elements*® and transported over a 14.2-m-long beamline to the posi- 
tion of the superfluid-*He target. The momentum was chosen to allow 
the largest possible number of 7 mesons to rapidly come to restinthe 
‘He target before they could decay while reducing the background 
arising from faster and slower particles that stopped in the walls of the 
apparatus. In our initial experiments, the high intensity (>3 x 10° s4) 
of contaminant electrons in the beam saturated the detectors and 
prevented the observation of m*He’*. An£ x B Wien filter was therefore 
placed in the beam upstream of the target to divert most of the elec- 
trons towards the steel blades of a slit collimator with a thickness of 
80mm. The filter consisted of a dipole magnet anda pair of 1.67-m-long 
electrode plates separated by a150-mm gap and with a voltage differ- 
ence of AV=550kV. This was the highest voltage that could be applied 
stably without frequent breakdowns. A pair of quadrupole magnets 
then focused the purified 7 beam of intensity N, = 2-3 x10’ stintoan 
elliptical spot of full-width at half-maximum horizontal and vertical 
beam diameters of 23 mm and 15mm, respectively. The beam diameters 
were determined by measuring the number of 7 hits ona scintillator 
of size 5 x 5 x 3 mmas it was slowly moved through the beam using a 
motorized translation stage. 

The beam traversed a plastic scintillator plate with a thickness 
of 4.7 mm before entering the target (see Fig. 2b). The plate was seg- 
mented into four sections of size 20 x 20 mm’, each of which was read 
out by metal-packaged photomultiplier tubes with 8-mm-diameter 
bialkali photocathodes. The last three dynode stages of each pho- 
tomultiplier were driven with field-effect transistors arranged in an 
emitter-follower configuration. This prevented the high intensity of 
the particle beam from strongly saturating the anode currents. The 
timing resolution over a 10-min interval was estimated as At< 0.8 ns. 
Radiation damage caused the light yields of the scintillators to slowly 
deteriorate, and they were replaced every 10-14 days. 

The contour plot in Fig. 3a shows the correlations between the arrival 
times t, and energy depositions AF of particles on the scintillators. The 
1 events located in the dashed rectangle were distinguished from 
the residual wz and e” events by the characteristic time of flight and 
estimated energy deposition AF = 2.6 MeV for 17. Thee’ background 
was thus suppressed by <10* compared to measurements carried out 
without the E x B Wien filter. Monte Carlo simulations” showed that 
remaining w ande misidentified in the analysis caused a small reduc- 
tion inthe signal-to-noise ratio of the laser spectroscopic signal that was 
negligible compared to the >13% statistical uncertainty of the signal. 


Experimental target 

The target (Fig.2b) consisted of a150-mm-long, 42-mm-diameter cylindri- 
cal aluminium chamber that was filled with superfluid *He of temperature 
T=1.7 K. The thin (¢;=500 pm) lateral walls of the chamber allowed the 
passage of the nuclear fission products with small energy losses, which 
could then be detected with a high efficiency. The chamber was cooled 
by a1-m-long Joule-Thompson refrigerator through which superfluid 
4He was continuously pumped by a roots blower and a rotary vane pump 
with respective pumping speeds of 500 m?h ‘and 200n’h". Them beam 
entered the chamber through a1.6-mm-thick aluminium window andtra- 
versed a 6-mm-thick, 36-mm-diameter mirror made of fused silica before 
reaching the superfluid *He. A dielectric multilayer coating of reflectivity 
>98% was deposited on the downstream surface of the mirror by electron 
beam evaporation. Monte Carlo simulations” carried out using the GEANT4 
package indicated that >80% of them mesons cametorest inthe superfluid 
4He, whereas the rest stopped in the mirror or the chamber walls. 


The 25-mm-diameter laser beam of pulse energy £, = 10 mJ and 
wavelengths between A = 1,515 nm and 1,634 nm entered througha 
35-mm-diameter fused-silica window with antireflection coating that 
was fixed on the opposite end of the chamber. The beam irradiated 
>60% of the m*He* atoms produced in the target, retro-reflected off 
the mirror, and exited through the window. This helped to reduce the 
heating of the target by the laser beam of average power P= 800 mW. 


Detectors and electronics 

Each of the 140 plastic scintillation counters of size 40 x 35 x 34 mm? 
surrounding the target was read out by photomultipliers of the type 
described previously with an intrinsic timing resolution of At<1ns. 
Variations in the times of flight of the fission products emerging from 
them absorption degraded this resolution to At= 2 ns. The segmen- 
tation reduced the pile-up probability of multiple particles striking a 
single counter in rapid sequence. The size of each counter was care- 
fully chosen to provide a significant (<10%) detection efficiency for 
signal neutrons® of kinetic energy F = 25 MeV while also allowing the 
rejection of most of the background electrons that arose either from 
the particle beam or from decays in the experimental apparatus. An 
electron that traversed the full thickness of a counter corresponding 
to 1/10 radiation length deposited an average energy of AF=6-8 MeV, 
witha distribution tail’ extending to higher energies. The signals were 
amplified by d.c. coupled amplifiers with bandwidth /, = 400 MHz and 
gaing=10. The analogue waveform of the signal from each counter was 
simultaneously recorded over a At= 83.7 ns interval for every laser pulse 
arrival, using 140 waveform digitizers with sampling rates f,=3.06 Gss7. 
For this, we constructed a readout system using application-specific 
integrated circuits (ASICs) based on switched capacitor arrays” 
(DRS4), sample-and-hold amplifiers and pipelined switched-capacitor 
analogue-to-digital converters. 40 field-programmable gate arrays 
continuously collected and recorded the waveform data*“ at a total 
acquisition rate of 13-15 GBh ona 24-h basis. 

The waveform data were analysed using an algorithm based on 
constant-fraction discrimination to determine the arrival times ¢, and 
energy depositions AF in the scintillator array. A typical t,-AE con- 
tour plot obtained in this way is shown in Fig. 3b. Because the detector 
could not identify the type of particle that traversed it, we selected 
the events located in the dashed rectangle, with energy depositions 
between AF = 20-25 MeV and AEF 260 MeV. This removed most of the 
above-mentioned background electrons with energy deposition 
AE < AE, = 20-25 MeV and low-velocity fission products that could 
degrade the timing resolution for detecting 7 absorptions. This rejec- 
tion threshold AE, was selected” based on Monte Carlo simulations 
carried out during the design of the counters. The time spectra in Fig. 3c 
show the distributions of the accepted events measured with and with- 
out laser irradiation at t= 9 ns (see main text). The At=4 ns widths of 
the peaks at t= 0 and 19.75 ns are primarily caused by variations in the 
times of flight of 7 mesons with momenta between p=83 MeV cand 
87 MeV c ' that arrive at the *He target. 


Laser system 

The OPG-OPA laser system was based on a continuous-wave 
external-cavity diode laser with an output power of P=50 mW, linewidth 
[<1MHzand wavelengths between/ =1,515 and 1,634 nm (see Fig. 2c). 
The optical frequency was continuously measured bya Michelson inter- 
ference wavelength meter and calibrated against the absorption lines* 
of °C°O molecules with a precision of <0.8 GHz. This seed beam was 
amplified by OPG” using a pair of 40-mm-long MgO:PPLN crystals set 
in series, which produced 800-ps-long laser pulses of energy F, = 70 pJ. 
OPA to E,,=10 mJ was carried out by passing these laser pulses through 
five KTP crystals of lengths /=20 mm. The broadband light arising from 
amplified spontaneous emission (ASE) in the MgO:PPLN crystals was 
discarded after each amplification stage using dielectric beam split- 
ters and irises. The ASE in the laser beam was monitored witha grating 


monochromator of spectral resolution AA = 300 pm. The linewidth?’ 
of the seeded narrowband component of the laser beam excluding 
the ASE was ~10 GHz. During the OPG and OPA processes, the optical 
frequency v,, of the amplified laser pulse could shift” relative to the 
seed laser v,,,. From relative measurements of v,,and v,,, using a Fizeau 
interferometer anda grating spectrometer”, and from rough numerical 
calculations of the OPA process”, a conservative uncertainty of 3 GHz 
was ascribed to the time-averaged optical frequency of the laser pulses 
that reached the m*He’. 

We constructed a diode-pumped solid state (DPSS) Nd:YAG laser 
to pump the nonlinear crystals. The laser was based on an actively 
Q-switched microchip laser oscillator and amplifier, which produced 
800-ps-long laser pulses of £, = 80 J andA = 1,064 nm. The beam was 
further amplified to £, = 1.5 mJ ina Nd:YAG fibre amplifier”, and then to 
50 mJ by passing the beam twice througha series of three transversely 
pumped DPSS Nd:YAG amplifiers with crystal rod lengths of [= 73, 
73 and 126 mm, diameters d=3, 3 and 5 mm, and peak pump powers 
P=2.4,2.4and 6 kW, respectively. The pulsed amplifiers were mounted 
ona3 x1.5 m’ optical table and water-cooled by three recirculating 
chillers having a total refrigeration power of 4 kW. The laser was trig- 
gered at arepetition rate of f, = 80.1Hz and atiming jitter of At<1nsin 
synchronization with the f, = 50.63 MHz accelerating radiofrequency 
of the cyclotron. The trigger signal f, was generated by dividing the 
radiofrequency f, by a factor of 632,400 using a two-phase frequency 
divider. This ensured that the laser pulses and 7 would arrive at the 
experimental target with the intended relative timings. The arrival time 
of the laser pulses, which was monitored by an infrared-sensitive film 
and a photomultiplier, was found to slowly drift by 1 ns over periods 
of a few hours, presumably owing to thermal effects in the microchip 
laser. This was constantly corrected. 

We developed a continuous-wave pulse-amplified dye laser and 
Ti:sapphire amplifier to search for the transition (16, 15) > (17, 14) at 
a wavelength of A = 383.8 nm. A continuous-wave distributed Bragg 
reflector semiconductor laser produced a seed beam of power 
P=40 mW, linewidth <2 MHz and A= 767 nm. The beam was ampli- 
fied to P=450 mW using a semiconductor tapered amplifier. The laser 
wavelength was measured using a Fizeau wavelength meter, which 
was calibrated to a precision of <1 GHz using a frequency-stabilized 
HeNe laser. The calibration was verified by carrying out absorption 
spectroscopy of the D, line of a natural isotopic mixture of K. The seed 
beam was pulse-amplified in two dye cells, which were oriented at 
Brewster’s angle and filled with Styryl-8 dye dissolved in ethanol. This 
produced 800-ps-long laser pulses of energy F, = 2 mJ. The beam was 
further amplified to £,, = 20-25 mJ by allowing the laser pulses to make 
five passes through a 20-mm-long Ti:sapphire crystal*®. Frequency 
doubling ina beta barium borate crystal generated 700-ps-long laser 
pulses of £, = 8 mJ and A= 383.8 nm. The dye cells were pumped by a 
frequency-doubled, amplified microchip laser system similar to the 
one used inthe OPG-OPA laser described above. The Ti:sapphire crystal 
was pumped by a separate frequency-doubled Nd:YAG laser, which 
generated 30-ns-long laser pulses of E, = 80-90 mJ. This DPSS laser 
was based on atriangular ring cavity’. 


Resonance of m*He* 

Time spectra of m*He* were measured at 17 settings of the laser 
frequency around the (n, [) = (17, 16) > (17, 15) resonance. The rela- 
tive intensity of the m*He* resonance signals (see Fig. 4a—d) was 
determined by taking the difference between normalized time 
spectra measured with and without the laser irradiation and count- 
ing the number of events under the laser-induced peak within a 
At =3.6-ns-wide region centred around t= 9.5 ns. This interval was 
chosen on the basis of a Monte Carlo simulation” that estimated 
the width At of the resonance signal. This width arose for several 
reasons: first, the fission products were emitted isotropically with 
kinetic energies between a few megaelectronvolts and F=80 MeV”, 


whereas the flight path of the particles to the array of 140 scintil- 
lation counters varied between 40 mm and 150 mm depending on 
the emission angle. The resulting time-of-flight dispersion caused a 
broadening of At~2 ns depending on the energy rejection threshold 
AE, of the scintillators. Second, Monte Carlo simulations indicate 
that the mHe* atoms were distributed over an 80-mm-long volume 
of the experimental target”. This corresponds to a 0.5-ns transit 
time for alaser beam that irradiated the volume, retro-reflected off 
the mirror and exited the chamber. Third, the timing profile of the 
1,631-nm laser pulse contained several peaks” that fluctuated from 
shot to shot, causing a 1-ns-scale jitter in the timing when the mHe* 
atoms underwent laser excitation. 

The vertical error bars in the resonance profile of Fig. 4e indicate 
the statistical uncertainty of the signals caused by the finite numbers 
of resonant m*He’ events. Each point represents data collected over 
20-30 h. Systematic uncertainties due to changes in the choice of cuts 
(optimized by Monte Carlo simulations; see above) were investigated 
by slightly varying the selection criteria for wv arrival events (Fig. 3a) 
and the detection of fission products by the scintillator array (Fig. 3b). 
The effect of this on the statistical significance of the m*He* resonance 
signal was small and agreed with the simulation results”, whereas the 
experimental value of the transition frequency v,,,, remained well within 
the systematic uncertainties of the measurement (see main text). 


Data availability 


As the datasets generated during and/or analysed during the current 
study are in total 20 TBytes in size consisting primarily of waveform 
data, they are available from the corresponding author on reasonable 
request. 
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® Check for updates 


State-of-the-art optical clocks! achieve precisions of 10’ or better using ensembles of 


atoms in optical lattices”? or individual ions in radio-frequency traps*>. Promising 
candidates for use in atomic clocks are highly charged ions® (HCIs) and nuclear 
transitions’, which are largely insensitive to external perturbations and reach 
wavelengths beyond the optical range® that are accessible to frequency combs’. 
However, insufficiently accurate atomic structure calculations hinder the 
identification of suitable transitions in HCIs. Here we report the observation of a 
long-lived metastable electronic state in an HCI by measuring the mass difference 
between the ground and excited states in rhenium, providing a non-destructive, 
direct determination of an electronic excitation energy. The result is in agreement 
with advanced calculations. We use the high-precision Penning trap mass 
spectrometer PENTATRAP to measure the cyclotron frequency ratio of the ground 
state to the metastable state of the ion with a precision of 10""—an improvement by a 
factor of ten compared with previous measurements. With a lifetime of about 

130 days, the potential soft-X-ray frequency reference at 4.96 x 10" hertz (corresponding 
toa transition energy of 202 electronvolts) has a linewidth of only 5 x 10’ hertzand one 
of the highest electronic quality factors (10) measured experimentally so far. The low 


uncertainty of our method will enable searches for further soft-X-ray clock 


transitions®” j 


in HCls, which are required for precision studies of fundamental physics®. 


Modern clocks and frequency standards range from ensembles of 
neutral particles trapped in optical lattice clocks*” to individual, sin- 
gly charged ions confined in Paul traps“. With a fractional frequency 
accuracy of 108, such clocks enable stringent tests of fundamental sym- 
metries (for example, Lorentz invariance’), geodetic measurements” 
and searches for new physics” and dark matter"®. The transitions used 
as frequency references typically have long lifetimes, of the order of 
seconds, yielding sub-hertz linewidths; a recent example is the Yb* 
clock’*’. Optical clocks use variations of the Ramsey method, driving 
such forbidden transitions through stimulated absorption and emis- 
sion much faster than the natural transition rate, but maintaining the 
very long coherence times needed for the interrogation schemes owing 
to their small spontaneous emission rates’. One promising species for 
use in the next generation of clocks is HCIs° because their compact size 
in comparison with atoms makes them less sensitive to external field 
fluctuations. Several transitions have been proposed to feature a high 
sensitivity to a variation of fundamental constants?°”). 

Although electronic binding energies in HCls typically amount to 
several kiloelectronvolts” and inter-shell transitions usually appear 
in the X-ray region, there are also intra-shell fine and hyperfine transi- 
tions in the optical and ultraviolet range”*?”. Furthermore, some HCIs 


feature level crossings®”’ with associated optical transitions. Most of 
these transitions are electric dipole (E1)-forbidden, and many are of the 
magnetic dipole (M1) type, with lifetimes of the order of milliseconds, 
but some are highly forbidden and can have extremely long lifetimes 
of up to millions of years”, as predicted theoretically. A few of these 
transitions with lifetimes ranging from milliseconds to seconds have 
been found and investigated using storage rings” and ion traps?°”!, 
but so far no method has enabled a direct observation of the excitation 
energy of extremely long-lived metastable states. 

Itis difficult to theoretically predict such electronic energies in HCls. 
Calculations suffer from difficulties related to strong relativistic con- 
tributions and complex correlations of several active electrons in open 
shells. Moreover, experimental identification of such transitions is 
challenging because their long lifetimes leads to sub-hertz linewidths 
and therefore requires very precise application of the Ritz-Rydberg 
method” to other, accurately known transitions to establish the search- 
ing range to be explored with narrow-linewidth spectroscopy lasers. 
Additionally, rough calculations with the flexible atomic code® can be 
used to estimate the energy of transitions. 

Here we demonstrate how high-precision Penning trap mass 
spectrometry directly identifies a suitable clock transition in HCIs by 
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Fig. 1|Results of the Re measurements at PENTATRAP. All error bars indicate 
louncertainty; for details see text. Left, three *’Re?” ions (ion1, blue; ion 2, red; 
ion3, blue) areloaded from the top into the trap stack. For each measurement 
trap, the potential @ of the neighbouring traps is nominally identical; that is, 
the potential of trap 2 is also applied in trap 4. The magnetic field points along 
the direction of the trap axis. Centre, by shifting the ions one trap down or up 
after acyclotron frequency determination, we obtain measurements withion1 


determining the mass difference of a’®*’Re”™ ion in the ground state 
and a metastable state to a precision below 2 eV or ~500 THz. A relative 
mass measurement of the two HCI states with an unprecedented pre- 
cision of 1x 10™ is achieved by determining the cyclotron frequency 
v.=qB/(2m™m) of the HCI with mass mand charge gina strong magnetic 
field (B = 7 T) and a weak electrostatic harmonic potential of the Pen- 
ning trap. The cyclotron frequency is derived by measuring the ion 
trap frequencies (modified cyclotron frequency, v, ~ 16 MHz; axial 
frequency, v, = 700 kHz; magnetron frequency, v_ ~ 10 kHz) and 
applying the invariance theorem* v2= v2 + v2+ v2. 


= 


Penning trap mass spectrometer PENTATRAP 


This work was carried out using the high-precision Penning trap mass 
spectrometer PENTATRAP at the Max-Planck-Institut fiir Kernphysik 
in Heidelberg. HCIs produced in the commercial electron beam ion 
trap DreEBIT* using the MIVOC technique” are extracted in 1-p1s-long 
bunches with a kinetic energy of a few kiloelectronvolts per charge, 
selected according to their charge-to-mass ratio, g/m, by means of a90° 
dipole magnet, and sent into the Penning traps. Prior to the trapping 
of the ions, their kinetic energy is reduced to a few electronvolts per 
charge by appropriately timed pulsed potentials applied to two cylin- 
drical drift tubes. The 4-K cold bore of the superconducting magnet 
houses five cylindrical Penning traps””**. Two of them (traps 2 and 3; see 
Fig. 1) are used to measure the trap frequencies of the ions of interest. 
Traps land 4 serve to store ions, whereas trap 5 will allow monitoring 


(R - 1.000000001161) x 10° 


(blue) andion 2 (red) in trap 2 and with ion 2 (red) and ion 3 (second blue ion) in 
trap 3. Here, the ion 2isin the ground state (red) and the other two (blue) arein 
the metastable electronic state. Right, all ratios (n=54) determined over seven 
measurement campaigns (divided by the dashed lines) display the stability of 
the system. The results for trap 2 (filled symbols) and trap 3 (empty symbols) 
showasimilar behaviour. The final weighted mean is shown inred. 


fluctuations of the trap potentials and the magnetic field in the future. 
To reduce temporal variations of the magnetic field in the traps, the 
temperature inthe laboratory is kept constant at a level of 0.1K per day. 
Furthermore, the level of liquid helium and the helium pressure inside 
the magnet cold bore are stabilized at a level of a fraction of a millimetre 
and 20 pbar, respectively. These measures reduce fractional changes of 
the magnetic field inthe measurement traps to values of less than afew 
parts in 10” per hour, resulting ina cyclotron frequency drift of onlya 
few millihertz per hour at v, = 16 MHz. Because the magnet is actively 
shielded witha shielding factor of 50, the fractional fluctuation of the 
magnetic field in the trap due to the fluctuation of the stray magnetic 
field does not exceed 3 parts in10". 

The measurement procedure of the cyclotron frequency ratio of two 
different ion states consists of the preparatory and the main phase. 
During the preparatory phase, three ions in the two states are first 
loaded into the three innermost traps (ion lin trap 2 and ion3intrap 4 
being inthe same state—for example, the ground state—whereas ion 2 
intrap 3 being inthe other state). The desired ion states are confirmed 
by arough determination of their respective cyclotron frequencies; if 
they are not inthe desired state, ions are reloaded. After that, their trap 
motion amplitudes are reduced (the ions are ‘cooled’) by coupling the 
magnetron and cyclotron motions to the axial motion, which in turn 
is brought into thermal equilibrium with a tank circuit having a tem- 
perature of around 6 K. The resonance frequency of the tank circuit is 
approximately equal to that of the axial motion. The axial, magnetron 
and cyclotron amplitudes of cooled *’Re?”* ions are approximately 
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10 um, 2m and 2 pm, respectively. Great care is taken to prepare only 
a single ion of interest in each trap, because the presence of another 
undesired ion inthe trap would disturb the motion of the ion of interest 
and hence alter its motional frequencies. The main phase is devoted 
to the measurement of the ion trap frequencies in traps 2 and 3. The 
reduced cyclotronand axial frequencies are simultaneously measured 
by means of the pulse-and-phase” and dip*° techniques, respectively. 
This minimizes systematic shifts arising from changes in the external 
fields compared with sequential measurements of the trap frequen- 
cies. The measurement cycle consists of a set of sub-measurements. 
First, the cyclotron motion is excited to an amplitude of approximately 
10 pm witha subsequent measurement of its phase (reference phase). It 
is followed by the re-cooling of all three motions. After that, the cyclo- 
tron motion is again excited to the same amplitude and is allowed to 
freely evolve its phase for 40 s during the subsequent measurement 
(measurement phase). During the phase evolution time of 40s, the axial 
frequency is measured with the dip method. The measurement cycleis 
ended by re-cooling all ion motions. Because the relative mass differ- 
ence between the ground and metastable states is at the level of 10°, 
itis sufficient to measure the smallest trap frequency—the magnetron 
frequency—oncea day witha moderate precision for the determination 
of the cyclotron frequency ratio. Ten measurement cycles constitute 
a measurement run. A unique feature of PENTATRAP is that during 
eachrun, frequency measurements are performed simultaneously in 
synchronized traps—for instance, onion 1in trap 2 (upper blue ionin 
Fig. 1) and ion 2 (red) in trap 3—for approximately 12 min. After that, 
the ion species are swapped between traps 2 and 3 by moving the three 
ions adiabatically one trap up. Then, ameasurement run is carried out 
with ion 2 in trap 2 and ion 3 (lower blue ion) in trap 3. Lastly, the ion 
species are swapped back by moving the three ions one trap down 
again. After each change of the configuration of ions in the traps, the 
ion motions are cooled for 20s. Inthis way, one alternately determines 
the cyclotron frequencies of ions in the ground and metastable states 
in each measurement trap. This measurement sequence is repeated 
until the measurementis stopped, given that—even though the vacuum 
allows storage times of several hours to days—ions are then lost owing 
to charge-exchange reactions with background gas. If determination 
of the metastable state is no longer possible, the traps are emptied and 
anewset of ions is loaded. 


Energy of the metastable state 


To obtain the cyclotron frequency ratio of the ions in the ground and 
metastable states, the polynomial method described, for instance, in 
refs. °"“1is applied in the following way. We treat the two measurement 
traps independently. The temporal variation of the magnetic field—and 
hence the cyclotron frequency—is approximated with a low-order poly- 
nomial, as shown inthe middle plots of Fig. 1. This assumption is based 
on our experience with the magnetic field evolution over a scale of a 
few hours. The full set of measured cyclotron frequencies is divided 
into groups. The group length must be as short as possible, such that 
it can be described by a low-order polynomial, but must contain sub- 
stantially more data points than the polynomial degrees of freedom. 
Thus, each group is chosen to be approximately 2 hlong and to contain 
ten cyclotron frequency points (five points correspond to the ion in 
the ground state and five to the ion in the metastable state); see middle 
plots of Fig. 1. We use third-grade polynomials because this is the low- 
est order that has at least one inflection point. Data points from ions 
in both the metastable and the ground state are fitted with the same 
polynomial using global fitting parameters. The polynomial for the 
data points from theion inthe ground state is scaled with an additional 
fitting parameter R, which corresponds to the frequency ratio. This 
yields three quantities: (1) the frequency ratio R, (2) its uncertainty 
ARand (3) the reduced x’. If x*>1, we scale the uncertainty of the ratio 
by x”. The final ratio R is obtained as a weighted mean of the frequency 
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ratios from all groups and measurementtraps. Its statistical uncertainty 
ARis the larger of the internal and external errors”; see right side of 
Fig. 1. A variation of the polynomial order (second, third and fourth) 
results in an increase of the statistical uncertainty of the final ratio 
by -10%. 

The data were blindly analysed independently by two people, each 
using separately developed software. Both analyses yield similar 
external errors of 8 x 10°” and internal errors of 7 x 10, resulting in 
a Birge ratio of 1.14. The two weighted means differ by 4 x 10". Thus, 
the combined final ratio is 


R-1=1.161(10)¢¢ar X 10°? 


(All reported uncertainties are 1o.) The uniqueness of the considered 
187Re** ions in the two electron configuration states is that the fractional 
difference in their masses is only 10°. Uncertainties due to systematic 
effects such as image charge shift, relativistic shift and higher-order 
magnetic and electric field (see, for example, ref. *?) variations can 
be neglected. For these uncertainties to be at the level of 10% of the 
statistical uncertainty (10~”), the cyclotron frequencies would have to 
be shifted by at least 10 kHz, whichis excluded considering the present 
known performance of PENTATRAP. Therefore, the systematic uncer- 
tainties in the final ratio are well below the statistical uncertainty and 
hence can be neglected. 

The ground and metastable states in'®’Re”™ belong to the Kr 4d'°'S, 
and 4d? 47H; electron configurations, respectively. Because the 
metastable state can only decay to the ground state—by a highly forbid- 
den electric triacontadipole (E5) transition—its estimated lifetime is 
~130 d. (We note that whereas optical transitions with similarly high 
multipolarities may feature much longer lifetimes’, the high transi- 
tion energy, together with hyperfine quenching, yields a lifetime in 
this regime.) Owing to repeated and fast processes of electron impact 
excitation followed by radiative decay inside the electron beam ion 
trap, low-lying metastable states and the ground state are roughly 
equally populated. In this case, the metastable ion fraction detected in 
PENTATRAP is approximately 50%. For the determination of the energy 
ofthe metastable state, seven measurement campaigns with three ions 
each were performed over the course of 12 days, during which not a 
single decay was detected. 

From the final ratio, the energy of the metastable state with respect 
to the ground state can be calculated using AFp. = m(!8’Re?"")(R - 1)c? 
(c, speed of light in vacuum). The uncertainty of the ion mass enters 
the total uncertainty reduced by a factor of (R- 1), and must be known 
to only Am/m =10~* to become negligible. Because the mass of the 29 
removed electrons and their binding energies are well below that limit, 
we can simply use the neutral mass of *’Re for the calculation, which 
gives AF, =202.2(17) eV. 


Calculations of the energy difference 


The high charge state and the complexity of the open 4dand 4fsubshells 
calls for state-of-the-art theoretical methods to match the experimental 
accuracy. We apply three different fully relativistic approaches, namely, 
the multiconfiguration Dirac-Hartree-Fock (MCDHF) method**** 
in two different implementations and the configuration interaction 
method employing Kohn-Sham orbitals, as implemented by the 
many-body script language Quanty“. The calculations also help to 
identify the observed metastable state. The first such state could 
be the 4a°4f P, level with an energy of ~195 eV (see Fig. 2). This level 
may decay only to the ground state by a highly forbidden two-photon 
transition. However, Re has a nuclear spin of /= 5/2 and thus a mag- 
netic moment that couples this *P, level to the nearby ’P, level, which 
can decay to the ground state by an E1 transition. The ?P, state thus 
acquires part of the probability of this allowed transition, a phenom- 
enon called hyperfine quenching” °°. Following ref. *” and using the 
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205+ oe 3H, 
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tTx0.7S w_ Experiment 
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0 1 2 3 4 5 6 


Total angular momentum, J 


Fig. 2|The 4d" ground state and relevant 4d°4fexcited electronic states of 
the’®’Re?* ion. Experimental result (shaded area) and theoretical values 
obtained using MCDHF approaches in two different implementations (MCDHF 
land 2) and aconfiguration-interaction calculation (Quanty). All error bars 
indicate louncertainty. The level structure is taken from the MCDHF1 
calculations. The lifetimes r of the levels of interest are denoted and the main 
decay processes (hyperfine-induced/quenched and ES) are stated. 


MCDFGME code™, we estimate a lifetime of 0.7 s, which is too short 
to be observed. The observed state must then be the next metastable 
level, 4€°4/°H,, which—neglecting quenching—can decay to the ground 
state only by an ES transition with a lifetime of ~239 d. Decays to the 
levels just below—that is, to the *Po, ?P, and °P, states—have very low 
probabilities because of their high multipolarities (M5, E4 and M3, 
respectively) and small photon energies. The 7H; state is also subject 
to hyperfine quenching; for example, its lifetime with a coupled atomic 
and nuclear angular momentum of F= 7/2 reduces to ~124 d. This cor- 
responds to a width-to-transition energy ratio of 3 x 10, enabling 
very-high-precision measurements. The states with F= 9/2 and F=11/2 
live 128 d, and other states coupled to more distant fine-structure levels 
have even longer lifetimes. 

Inthe Quanty calculations of the transition energies, the relativistic 
Dirac Hamiltonian includes the Coulomb and Breit interactions between 
all electrons. The Fock space is spanned by multi-Slater-determinant 
states constructed from relativistic Kohn—-Sham single-electron states, 
obtained from the density functional code FPLO”, using the local spin 
density functional of ref. *°. Within this calculation we explicitly include 
shells with principal quantum numbers n < n,,,, = 7. To extrapolate to 
Nmax =~, We compare results for calculations withn<7,n<6andn<5. 
Allowing quantum fluctuations within the n=4 shells, that is, including 
configurations with holes in the 4s—4d shells and additional electrons 
inthe 4fshell, we calculate the energies of the 4d°4fexcited statesina 
first approximation. Allowing for amaximum of three electrons inthe 
4f shell suffices to calculate the excitation energy with the required 
accuracy. Next, we extend the configuration space to allow electrons 
to scatter between the n= 4, 5 and 6 shells, where at most three holes 
are excited. By successively adding further configurations, we can 
extrapolate the excitation energy of the 7H, metastable state to be 
between 202.41 eV and 203.26 eV. 

In the MCDHF calculations, we optimize not only the mixing coef- 
ficients of the configurations but also the radial single-electron wave 
functions by self-consistently solving the corresponding radial Dirac 
equations. Quantum electrodynamics corrections to the excitation 
energy are estimated by the model potential method™ and by computing 
the self-energy shift of the 4d and 4f valence electrons using effective 
potentials, which account for screening by the core electrons. One of the 
MCDHF calculations (MCDHF1) employs MCDFGME code”, including 


full relaxation of all spectroscopic orbitals, as well as single and double 
excitations from all n=4 orbitals to the free single-electron states up to 
5d, with a result of 202.1(27) eV. In the other computation (MCDHF2), 
using GRASP2018>, we generate the set of configurations with single and 
double excitations from the n=4 states up to 9h, with the virtual orbit- 
als optimized layer by layer. Subsequently, the effect of triple electron 
exchanges is accounted for inaconfiguration interaction approach. The 
convergence of the results is monitored while systematically expand- 
ing the configuration set, allowing us to estimate the uncertainty of 
the calculations and providing an extrapolated value of 202.2(5) eV. 


Conclusion 


Figure 2 shows a comparison of the three theoretical values with the 
experimental one. As acrosscheck, we also performed frequency ratio 
measurements (with less statistics) of the same metastable state in 
Os?**, which is isoelectronic to Re?**. We obtained AF,, = 207(3) eV. As 
the electronic binding energy roughly scales with Z? (Z, atomic number), 
this higher energy is explained by the additional proton in Os (Z= 76) 
and the experimental result is again in agreement with the theoretical 
predictions, which are not explicitly given. 

In summary, we have demonstrated a method of determining the 
excitation energies of extremely long-lived metastable states in HCIs 
that yields realistic measurement times if the metastable fraction in 
the atomic orionic sample is at least 10% and the lifetime is longer than 
afew hours. In principle, with the precision achieved here for Re ions, 
it would also be possible to detect nuclear transitions with energies as 
lowas 5 eV if the abundance of the isomeric nuclear state in the ions is 
comparable to that of the ground state. 
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The recently discovered flat electronic bands and strongly correlated and 
superconducting phases in magic-angle twisted bilayer graphene (MATBG)’” crucially 
depend on the interlayer twist angle, 9. Although control of the global O witha 
precision of about 0.1 degrees has been demonstrated’ ”, little information is available 
onthe distribution of the local twist angles. Here we use a nanoscale on-tip scanning 
superconducting quantum interference device (SQUID-on-tip)® to obtain 
tomographic images of the Landau levels in the quantum Hall state’ and to map the 
local 6 variations in hexagonal boron nitride (hBN)-encapsulated MATBG devices with 
relative precision better than 0.002 degrees and a spatial resolution of a few moiré 
periods. We find a correlation between the degree of 0 disorder and the quality of the 
MATBGtransport characteristics and show that even state-of-the-art devices—which 
exhibit correlated states, Landau fans and superconductivity—display considerable 
local variation in 0 of up to 0.1 degrees, exhibiting substantial gradients and networks 
of jumps, and may contain areas with no local MATBG behaviour. We observe that the 


correlated states in MATBG are particularly fragile with respect to the twist-angle 
disorder. We also show that the gradients of 0 generate large gate-tunable in-plane 
electric fields, unscreened even in the metallic regions, which profoundly alter the 
quantum Hall state by forming edge channels in the bulk of the sample and may affect 
the phase diagram of the correlated and superconducting states. We thus establish 
the importance of 8 disorder as an unconventional type of disorder enabling the use 
of twist-angle gradients for bandstructure engineering, for realization of correlated 
phenomena and for gate-tunable built-in planar electric fields for device applications. 


Strong electronic correlations arise in twisted bilayer graphene when 
the low energy bands become exceedingly narrow in the vicinity of the 
magic angle’ ”, 6,,~1.1°. Initial estimates of the bandwidth of these flat 
bands assumed a rigid and uniform rotation between the two graphene 
sheets, leading to a moiré pattern” 8. Recent bandstructure calcula- 
tions have shown, however, that twist-angle relaxation within a single 
supercell (about 13 nm for @~1.1°), results in electronic reconstruction 
that substantially modifies the bandstructure». Because the band- 
structure of the flat bands is determined on a scale of several super- 
cells, twist angle gradients—similarly to the predicted strong effects of 
heterostrain’*”—should modify the single-particle bandstructure and 
induce symmetry breaking, possibly leading to properties that have not 
yet been considered. Moreover, as correlated phenomena may occur 
owing to electronic interactions on distances larger than the supercell, 
twist-angle variations may affect the stability of the competing orders, 
enriching the phase diagram of the correlated states. 

Scanning tunnelling microscopy studies have shown that the local 
twist angle can vary substantially in the same sample; these studies 
have also observed stacking faults and structural defects'**. Large 


inhomogeneities and extensive networks of stacking faults in bilayer 
graphene have also been observed by transmission electron micros- 
copy’”>’”, In this work, we provide high-resolution maps of @(r) in 
hBN-encapsulated MATBG devices (here r= (x, y) isthe position within 
the sample). The results reveal sizeable twist-angle gradients that con- 
stitute an unconventional type of disorder that strongly affects both the 
stability of correlated phases and magneto-transport characteristics 
in twisted bilayer graphene. 

We present here local studies of two samples (devices A and B, 
see Methods), fabricated using the ‘tear-and-stack’ technique*®*”’. 
Their global transport data show characteristic MATBG features’ ” 
including superconductivity, correlated insulator states at integer 
fractions of n, (four electrons per moiré supercell), and Landau fans, 
from which a global twist angle 6 = 1.06° is derived (device B, Fig. 1b). 
We derive local maps of @(r) by imaging the structure of the Landau 
levels throughout the sample. In a conventional quantum Hall state, 
alternating compressible and incompressible strips are formed near 
the sample edges, where the Fermi energy €; correspondingly resides 
within the Landau levels or in the energy gaps between them. These 
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Fig. 1| Comparison between the global and local quantum Hall signatures 
in flat and dispersive bands in MATBG. a, Experimental setup schematics 
with SOT scanning over MATBG (blue and magnified view) encapsulated in hBN 
(light blue). A voltage vic + Vj, is applied between the PdAu backgate and the 
grounded MATBG. The twist-angle gradient V6 induces an internal electric 
field and counterpropagating equilibrium quantum Hall topological /" and 
nontopological /"' currents in narrowincompressible (red) and wider 
compressible (blue) strips, respectively, flowing along equi-@ contours and 
detected by BY. Vi, causes the strips to oscillate in their position with 
amplitude r,..b, Global R,,. versus electron density n, and B, for device B, 
showing insulating states at integer fillings n./(n,/4), Landau fans and 
superconductivity.c, R,,(n,) at B, = 1.08 T (extracted from b along the dashed 


strips respectively carry counterpropagating nontopological (/*') 
and topological (/") equilibrium currents, as demonstrated recently 
in graphene’. By contrast, in MATBG these strips are found in the bulk 
of the sample instead of along the edges (Fig. 1a). 
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purple line). A global value @=1.06° is determined from the transport data. 

d, B% measured at a point in the bulk of device B versus n, at B, = 1.08 T. The 
sharp B* peaks reflect /" inincompressible strips with the sign determined by 
0,,, the magnitude by the energy gap of the Landau level and the separation by 
the Landau level degeneracy (red bars). The dispersive bands are shadedin 
yellow, the signal in the flat bands is amplified three times, and the p-band 
signal is multiplied by —1 for clarity. From the position of the B3° peaks the local 
twist angle is determined to be 8=1.053 + 0.005°. e, Calculated bandstructure 
with flat (white) and dispersive (yellow) bands indicated. Blue and red 
represent the two valleys. f, Magnified B2° peaks in the dispersive bands for 
device A at B, =1.19T, illustrating the procedure for determining the local n, and 
the corresponding local 6. The p-band signal is multiplied by -1. 


We image these currents using a superconducting quantum interfer- 
ence device fabricated on the apex ofa sharp pipette®’ (SQUID-on-tip, 
SOT; Fig. 1a). The Pb SOT, with a typical diameter d= 200 nm, is scanned 
at a height of hs9>; = 30 nm above the sample surface at 7=300 mK in 
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Fig. 2 | Structure of the Landau levels and derivation of the twist angle 
along aline scan. a, BY(x) versus V,, for device A acquired along the dashed 
line in Fig. 3a. The top axis denotes n./(n,(x)/4) for x= 0 and the separation 
between the yellow dashed lines describes the evolution of n,(x). The 


an out-of-plane magnetic field, B,~1T. We apply asmalla.c. excitation, 
Vig Onto the d.c. backgate voltage, vec. which causes a small a.c. dis- 
placement r,, of the position of the/" strips along the direction of the 
twist-angle gradient, VO(r) (see Fig. 1a. Henceforth, V = (0,, 0,) repre- 
sents the two-dimensional gradient in the x-y plane). The correspond- 
ing a.c. Biot-Savart magnetic field, B%’, is directly proportional to the 
local current density, eliminating the need for current reconstruction 
(see Methods). B8° shows a sharp peak whenever the narrow (approxi- 
mately 50-nm width) incompressible strips pass under the tip (Fig. 1a 
and Extended Data Fig. 4), providing very sensitive means for nanoscale 
imaging of the Landau levels. 

Figure 1d shows a sequence of these B* peaks versus the local carrier 
density n, for device B, acquired at a fixed SOT position, incomparison 
withthe corresponding trace of the longitudinal resistance R,, (Fig. 1c) 
at B, = 1.08 T. The position and magnitude of these peaks provide a 
wealth of information. Anincompressible quantum Hall strip appears 
at location r in the sample when the local carrier density precisely 
matches an integer number N of full Landau levels, |n.(r)| = ZN|B,|/®o, 
where gis the Landau level degeneracy and @)=h/e (eis the elementary 
charge, his Planck’s constant). Hence, the spacing An, between adjacent 
peaks reveals the degeneracy g of the Landau levels. The height of the 
B* peaks is proportional to/" =o,,Ae,/e (where o,,=ve’/his the quantum 
Hall conductance and v=gNis the integer filling factor), and thus reflects 
the energy gap between the adjacent Landau levels, Aé, = E\n+1 - Ein) 
where n= 0, +1, +2... is the Landau level index (see Methods). 

We start by inspecting high dopings, |n,| >n,, for which the Fermi 
level €, resides in the dispersive bands (yellow in Fig. le). Figure 1f pre- 
sents a magnification of the four lowest Landau levels inthe electron-like 
(n) and hole-like (p) dispersive bands for device A at B, = 1.19 T. The 
spacing between neighbouring peaks is An, = 1.15 x 10" cm’ =4B,/@p, 
showing that these Landau levels are fourfold degenerate. The spacing 
between the corresponding p and nLandau levels is 2(n,(r) +4M|B,|/@o), 
as illustrated in Fig. 1f. The/" peaks are very sharp, and so high-accuracy 
determination of the local n,(r) and thus of the local twist angle 
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dispersive-band regions are marked in yellow. The signal in the flat bands is 
amplified seven times and multiplied by —1 for p doping such that 
incompressible strips are bright. b-d, The derived position-dependent n,(x) 
(b), (x) (c) and the charge disorder 6n,(x) (d). 


O(r) = a.| /3.n,(1)/8 (a= 0.246 nm is the graphene lattice constant) is 
obtained with an absolute accuracy of +0.005° and a relative accuracy 
between different locations of +0.0002° (see Methods). In the 
two-dimensional (2D) scanning mode described below, we attain a 
sensitivity of 0.007° per Hz’? and provide 6(r) maps with relative accu- 
racy better than +0.002°. 

Instead of measuring at a fixed location, Fig. 2a shows B3°in device A 
acquired uponscanning the SOT along the white dashed line in Fig. 3a 
and sweeping V,,,, revealing that the Landau levels vary in space, form- 
ing rich patterns. Moreover, the degeneracy of the higher Landau lev- 
els toggles between fourfold and eightfold as a function of position, 
and a pronounced asymmetry between the Landau level structure in 
the nand p dispersive bands is observed. 

Asin Fig. 1f, by tracing the spacing between the lower Landau levels 
we derive the local n,(x) = C[V,.(x) — V_ns(x)]/2, where V,,(x) and V_,.(x) 
are the backgate voltages corresponding to the local filling of the flat 
bands |n.(x)| =n,(x) (dashed yellow curves in Fig. 2a), and Cis the back- 
gate capacitance (see Methods). The obtained n,(x) (Fig. 2b) varies by 
about 2.4 x10" cm, corresponding toa local variation in 0(x) of 3.9%, 
from 1.124° to 1.169° over the 2.7-um-long path (Fig. 2c). In addition to 
the twist-angle disorder, which shifts the p and n Landau levels antisym- 
metrically, we also derive the local charge disorder ng(x), which shifts 
all the Landau levels symmetrically through variation of the local charge 
neutrality point (CNP), g(x) = CVeyp(X) = [Vas(X) + V_y.(X) 1/2. The derived 
charge disorder 6ng(x) = ng(x) — Ny (Fig. 2d) has a standard deviation 
(s.d.) =0.8 x10" cm”, which is substantially smaller than the variation 
inn,(x), showing that the dominant source of disorder in this MATBG 
device arises from variations in 6(r), as evidenced in Fig. 2a by the 
antisymmetric bending of the dispersive p and n Landau levels. 

To derive full maps of the local twist angle 0(r) and the charge disor- 
der 5n,(r), we acquired B2°(r) (Supplementary Videos 1-4) by incre- 
menting V,, through the bottom of the dispersive bands. Figure 3a 
displays one frame from Supplementary Video 2 showing a large-area 
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Fig. 3| Mapping the twist angle and Landau levels in MATBG. a, B* image of 
the dashed area indicated in the inset at V,, =-16.4 V. Incompressible 
(compressible) quantum Hall regions are visible as bright blue to yellow (dark 
blue). Inset, AFM image of the hBN-encapsulated MATBG device A with edges 
outlined in white; the light-brown area indicates the underlying PdAu backgate 
and the bright spots show bubbles. b, 2D map of the twist angle 6(r) derived 
from tomography of Supplementary Video 1in the dashed region shownina. 
Grey-blue indicates regions that do not display MATBG physics owing to 
disorder (bubbles outlined in black) or to atwist angle that is substantially 


scan of device A (dashed rectangle in the inset), and Supplementary 
Video 1 presents a magnification of the central region (dashed rectan- 
gle in Fig. 3a). The red stripes reveal incompressible regions carrying 
I and the dark blue regions mark the compressible areas carrying the 
counterpropagating /". As V,, varies, the quantum Hall states move 
and change their shape in an intricate manner. Notably, the quantum 
Hall edge states are present in the bulk of the sample and do not flow 
parallel to the sample edges as would be expected. Moreover, large 
parts of the sample do not show Landau levels at all. These are the 
regions that are either highly disordered or may have atwist angle that 
is very different to the magic angle, with @ either close to zero or 8>1.5° 
such that the dispersive bands are reached at a V,, outside our range. 
Thus, the magic-angle behaviour appears only ina limited central 
region of the sample and does not fully extend to the edges. By com- 
parison, device B displays quantum Hall states over most of its area as 
revealed by the image of B*° in Fig. 3e. Supplementary Videos 3 and 4 
show the evolution of the Landau levels in the central part of the Hall 
bar structure (dashed rectangle in the inset) in the p and n dispersive 
bands, respectively. 

Using these data we generate a three-dimensional (3D) tomographic 
rendering of the Landau levels throughout the samples (see Methods), 
which can be inspected interactively online*’. Figure 3d shows a slice 
of the tomographic data of device A (see Supplementary Video 5), 
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different to the magic angle. The dashed area is presented tomographically in 
d.c,2D map of |V @(r)| showing patches of slowly varying 6(r) and a network of 
abruptjumps in 6. d, A slice from the tomography of device A showing 
disordered Landau levels in the bulk of the sample in the p dispersive band 
(Supplementary Video 5, and see ref. *° for aninteractive interface). Thexaxis is 
flipped for clarity. e-g, As ina-c, for device B. e was acquired at V,,=-15 Vandf 
is derived from tomography of Supplementary Videos 3, 4. h, Charge disorder 
map 6n,(r) of device B. i, Histogram of local 9 in devices A and B. The dashed 
lines mark the global 6 derived from transport measurements. 


revealing the layered structure of incompressible (light blue/red) and 
compressible (dark blue) quantum Hall regions. Notably, the Landau 
levels display steep slopes and numerous small jumps in the bulk of 
the sample, revealing that at any value of V,, (represented by a hori- 
zontal tomographic plane) several different Landau levels cross €, in 
the bulk of the sample, never forming a well defined single quantum 
Hall state. This observation explains the absence of clear conductance 
oscillations and quantization in the global R,,, data in the dispersive 
bands in Fig. Ic, despite the presence of fully developed Landau levels 
as observed locally in Fig. 1d. 

Applying the procedure of Fig. 1fto the tomographic data, we derive 
2D maps of the charge disorder (Fig. 3h; see discussion in Methods) 
and of the twist angle 6(r) in devices A and B (Fig. 3b, f). The grey-blue 
colour in Fig. 3b reflects areas where no quantum Hall states were 
detected within the measured span of V,,. These regions correlate with 
the locations of bubbles (black outlines) as revealed by atomic force 
microscopy (AFM) of device A (Fig. 3a, inset). Magic-angle behaviour 
is apparently absent within the bubbles and in their surrounding areas, 
up to 0.5 pm from the bubble edges. The Landau levels are also absent 
in additional regions where no particular features were observed by 
AFM. The map in Fig. 3b also shows that the magic-angle regions in 
device A do not create a percolation path between the contacts. This 
is consistent with our transport measurements, which do not show 
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Fig. 4| Internal electric fields, unconventional quantum Hall state and 
equilibrium currents induced by twist-angle gradients. a, Schematic 
representation of a MATBG sample with two connected regions of different 
twist angle, giving rise to an internal electric field £,.b, Schematic DOS in the 
two regions with differing n,. The backgate voltage V,,, imposes nearly the same 
electron density 7, (blue areas) in the two regions; however, the difference inn, 
results in different chemical potentials 1, and jp, giving rise to £, = Vyu/q. 

c-e, Finite element calculation of the potential V(x) = -y1(x)/q (c), the electric 


fully developed superconductivity, although correlated insulating 
states are present in this device. By contrast, in device B four-probe 
transport measurements showed high-quality correlated insulator 
states at multiple integer filling factors, and a zero-resistance super- 
conducting state (see Methods) consistent with the observation that 
the magic-angle area extends over the entire length of the central part 
of the device and shows a more uniform @(r) (Fig. 3f). 

The magic-angle regions show substantial twist-angle disorder 
(Fig. 3i, histogram). 0(r) spans a range of 0.13° (1.05° to 1.18° with 
s.d.=0.025°) in device A (Fig. 3b) and 0.10° in device B (0.98° to 1.08°, 
s.d. = 0.022°, Fig. 3f). Moreover, the topography of 0(r) is nontrivial, 
with numerous peaks and valleys as well as saddle points. Because the 
Landau levels in the dispersive band follow the bottom of the band, 
n= 807/(/3 a2), they appear first at the minimum of the @(r) landscape, 
which for device A occurs in the lower-right corner (dark brown in 
Fig. 3b). This behaviour is clearly visible in Supplementary Video 1, in 
which arc-like incompressible strips (bright) first appear at this corner 
and upon increasing |V,,| ‘climb’ the amphitheatre-like 0(r) landscape 
following the equi-@(r) contours. Similar behaviour is observed in other 
regions, with interesting dynamics occurring at the saddle points, as 
described in Methods. 

The @(r) derived in Fig. 3b, fis smooth with typical gradients of around 
0.05° pm. Figure 3c, g (which shows the gradient maps |V6(r)|), reveals 
that variations in 0(r) partially occur through a network of small steps of 
variable sizes, reaching amaximum step size of about 0.01°. The derived 
patterns strongly resemble the stacking-fault networks in bilayer gra- 
phene observed by transmission electron microscopy” ~’. These 
steps cause the stepwise jumps in the Landau levels that are visible 
in the tomographic view in Fig. 3d and Extended Data Fig. 8a, c. This 
finding implies that the smooth variations in 0(r) are accompanied by 
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field E(x) (d) and the carrier concentration 6n, =n,(x) — 7, (e) for the case of 
linear change in @from1.10° to1.15° with a 0.025° pm‘ gradient and an average 
carrier density 2,=3.25 x10” cm at B, = 0. f-i, Single-particle calculation of 
Landau level energies €,(x) inthe electron-like dispersive band (blue) and the 
self-consistent calculation of the chemical potential p(x) (red; f), E(x) (g),J"(x) 
(h) and &n, (x) (i) at B, = 1.22 T. Sharp peaks in F, and" correspond to narrow 
(about 40-nm wide) incompressible strips (shaded red) that are observed 
experimentally as peaks in Be. 


occasional small, abrupt changes across stacking faults that relax the 
tensile and shear stress. 

The revealed twist-angle disorder and gradients VO(r) may have 
substantial implications on the phase diagram and transport proper- 
ties of MATBG. Connecting regions of different 6 (Fig. 4a) is akin to 
connecting materials with different work functions (Fig. 4b), resulting 
in band-bending and creation of internal electric fields (Fig. 4c-e). 
The backgate voltage, V,,, imposes a nearly uniform carrier density, 
A. ~ CV. Locations with different 0(r), and hence a different density 
of states (DOS), translate this 2, into a different chemical potential 
u(r). At thermal equilibrium the Fermi level must be uniform, €; = 
pu+qV=0 (where the last equality reflects the grounding of the device; 
q=te is the charge of the carriers), and so variation in the chemical 
potential y(r) imposes band-bending—that is, it imposes variation in 
the electric potential V(r) = —y1(r)/q and creates an in-plane electric 
field E, = -V V that cannot be screened. Using the DOS derived from 
bandstructure calculations (see Methods), Fig. 4c-e presents a 
self-consistent numerical calculation of Vix), E(x) and 6n,(x) =n.(x) - Ne 
at B, =0 for the case of linearly varying 0(x) with V@=0.025° ym‘ com- 
parable to the measured average gradients in Fig. 3c, g. A large electric 
field £~0.4kV m ‘is formed in the region of varying 6(x), whereas the 
accompanying charge redistribution remains negligible, 
6n.(x)/fi, = +3 x 10°. Note that for Vg= 0, the twist-angle disorder has 
essentially no effect (Au(r) = 0 and £, = 0) and its impact grows with 
increasing |f,|. 

In the presence of a magnetic field, Landau levels are formed as 
depicted by the blue lines in Fig. 4f (see Methods). At high doping, 
|n.| >n,, gradients in 6(r)—and therefore in n,(r)—induce variation in 
the dispersive band occupation, |n,|-n,(r), because n,= CV,, is approxi- 
mately constant. As a result, an unusual quantum Hall state emerges 


Nature | Vol581 | 7May 2020 | 51 


Article 


in which instead of being restricted to the edges, the quantum Hall 
edge states are formed in the bulk, creating interlaced compress- 
ible and incompressible strips with different integer filling factors 
(Fig. 4g-i). This absence of a well defined quantum Hall state provides 
an explanation for the Shubnikov—de Haas (SdH) oscillations with- 
out full conductance quantization that are commonly seen in MATBG 
magnetotransport’”. 

In contrast with conventional quantum Hall behaviour, in which 
the edge states must form closed loops, here the edge states seem to 
terminate in the bulk upon reaching apparently disordered metallic 
regions (Supplementary Videos 1-4). Moreover, instead of the constant 
carrier density that is usually required in incompressible regions, inthe 
presence of a 0 gradient the density varies following the variation in 
n.(r) (Fig. 4i). V@(r) also causes accidental Landau level crossings (for 
example, indicated by the red circle in Fig. 4f) giving rise to occasional 
eightfold degenerate Landau levels in the dispersive bands, as observed 
in Figs. 1d, 2a and Extended Data Fig. 6. Figure 4g also shows large elec- 
tric fields (approximately 10° V m‘) formed inthe incompressible strips, 
giving rise to very narrow channels of persistent current /" (Fig. 4h), 
consistent with the experimental data (Extended Data Fig. 4g). The 
typical width of the channels (around 50 nm) along with the local |V@(r)|, 
determines the spatial resolution of our 6(r) mapping (see Methods). 

Finally, we discuss the rich structure observed in the flat bands in 
Figs. 1,2. In contrast to transport measurements that resolve SdH oscil- 
lations at high fields where some of the degeneracies may be lifted, 
we probe the Landau levels locally at relatively low fields. The zeroth 
Landau level at the charge neutrality point is apparently eightfold 
degenerate, followed by fourfold degenerate Landau levels on both 
sides (Fig. 1d). It has been argued that such degeneracy indicates break- 
ing of C,symmetry””, which may in turn be triggered by the observed 
6 gradients. Figures 1d, 2a show that these Landau levels are sometimes 
observed to extend beyond n./4 on both the pand nsides, although at 
other locations new, irregular Landau levels seem to emerge for n dop- 
ing above n,/4, as visible in Fig. 1d. The Landau levels clearly reappear 
above n,/2 for both dopings, showing a degeneracy of 2 (Fig. 2a and 
Extended Data Fig. 7). We occasionally observe single-fold Landau 
levels above 3n,/4 for both dopings, as seen in Fig. 1d and Extended Data 
Fig. 7. We also observe that the amplitudes of the /" peaks stemming 
from different integer fillings often follow a smooth envelope. Since 
the amplitudes of the /" peaks are proportional to the energy gaps Ae,, 
this signifies that the energy gaps between consecutive Landau levels 
are of similar (not alternating) magnitudes, indicating full lifting ofa 
degeneracy. Importantly, Figs. 1d, 2a and Extended Data Fig. 7 show 
that the Landau levels near n./4 and n,/2 are discontinuous and that 
those above 3n,/4 appear only at a few locations, indicating the extreme 
fragility of the correlated states to twist-angle disorder. 

Twist-angle disorder as we have described here is a previously unrec- 
ognized type of disorder. It is fundamentally different to other, more 
common disorders, owing to its ability to change the local bandstruc- 
ture and to induce large unscreened electric fields. The growth of its 
effect with the carrier density |n,| explains the higher visibility of the 
Landau fan near the charge neutrality point in transport measure- 
ments. The charge disorder in graphene is commonly characterized 
by the width of the resistance peak at the charge neutrality point. This 
width, however, bears almost no information on twist-angle disorder. 
Instead, our results suggest that the twist-angle disorder should be 
quantified by analysing the width of the resistive peak at n, and the vis- 
ibility of the Landau fans at |n,| > n,. Our finding that the quantum Hall 
state is strongly altered by the twist-angle gradient suggests that other 
correlated phases in MATBG— including magnetism and supercon- 
ductivity—may also be fundamentally transformed by the twist-angle 
disorder. The gate-tunable intrinsic in-plane electric fields generated 
by the twist-angle gradients may also be of practical importance for 
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photovoltaic and thermoelectric applications of atomically thin twisted 
van der Waals materials. 
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Methods 


Device fabrication 

The MATBG devices were fabricated using the previously reported ‘tear 
and stack’ technique”*”*’. We first exfoliated monolayer graphene and 
hBN of 10- to 50-nm thickness on SiO,/Si substrates, annealed at 350 °C 
(for hBN only) and selected using optical microscopy and AFM. Only 
flakes without wrinkles and bubbles were used. A PC/PDMS polymer 
stack ona glass slide mounted ona micro-positioning stage was used 
to pick up an approximately10-nm-thick hBN flake. The edge of the hBN 
flake was then used to tear a graphene flake. The substrate was rotated 
by 1.1° to 1.2°, followed by pickup of the other piece of graphene. The 
resulting stack was encapsulated with another hBN flake of thicknesses 
of 30-70 nm that had been put onto a metallic gate made of evaporated 
Cr/PdAu. The device geometry was defined by electron-beam lithogra- 
phy and reactive ion etching, keeping only the relatively clean regions. 
Electrical contacts to the MATBG were made by the one-dimensional 
edge-contact method. 

Optical images of devices A and B are shown in Extended Data 
Fig. 1a, b, respectively. Device A was fabricated on a degenerately doped 
Si substrate with 300-nm SiO,. The MATBG resides partly on SiO, and 
partly on the evaporated metallic backgate (light brown in Extended 
Data Fig. 1a). In this work, only the metallic backgate has been used for 
varying the carrier concentration n, and a constant voltage Vin =50V 
was applied to the Si backgate for keeping the rest of the sample con- 
ductive during the transport measurements. Device B was fabricated 
onanintrinsic Si substrate with a metallic backgate extending over the 
full size of the device (light blue in Extended Data Fig. 1b). 


Transport characteristics 

Four-probe resistance measurements of the samples at 7=300 mK are 
shown in Extended Data Figs. 2, 3. Both devices exhibit the common 
transport characteristics of correlated physics in MATBG! °°”, includ- 
ing R,,. peaks at n, and its integer fractions, and Landau fans at elevated 
magnetic field. The slopes of the Landau fans in Extended Data Fig. 2a, b 
were used to extract the backgate capacitances C of 3.07 x 10" cm? V7 
(49.23 nF cm”) for device A and 2.31x 10". cm? V 1(37 nF cm”) for deviceB, 
consistent with the evaluated dielectric thickness of the underlying hBN. 
The origins of the Landau fans were used to derive the global n, and the 
corresponding global @=1.15° for device A and 9 =1.06° for device B, in 
good correspondence with histograms of the local twist angle in Fig. 3i.In 
device A the global 8 correlates with the average of the @(r) distribution, 
whereas in device B it is close to the upper end of the distribution func- 
tion. Consistently, the four-probe transport measurements in device B 
probe the central part of the Hall bar structure (Extended Data Fig. 1b), 
where O(r) is at its highest and is considerably more uniform (Fig. 3f), 
whereas the lowend tail of @(r) distribution arises from regions that are 
not probed by transport measurements. 

In addition, in device B we observe the superconducting state in 
the vicinity of p-doped n,/2 with zero R,,; this superconducting state 
becomes suppressed by small magnetic field (Extended Data Fig. 3b). 
The critical current inthe superconducting state reaches about 100 nA, 
as determined by the differential dV/d/ characteristics (Extended Data 
Fig. 3c), and depends sensitively on the carrier density n,. The observa- 
tion of fully developed superconductivity in device B is consistent with 
the finding of a continuous magic-angle region between the voltage 
contacts in Fig. 3f. Suppression of the resistance was also observed in 
device A (Extended Data Fig. 3a), but the lowest R,,.was 328 O, sugges- 
tive of the presence of some superconducting regions but absence of 
a percolation path between the voltage contacts, consistent with the 
6 map in Fig. 3b. 


SOT fabrication and characterization 


The Pb SOTs were fabricated as described in ref. ® with diameters rang- 
ing from 220 to 250 nm, and they included an integrated shunt resistor 


onthetip®. The SOT readout was carried out using a cryogenic SQUID 
series array amplifier*® **. The magnetic imaging was performedina*He 
system” at 300 mK, at which the Pb SOTs can operate in magnetic fields 
ofupto1.8T. At the fields B, ~1.2 T usedin this study, the SOTs displayed 
flux noise down to 250 n@, per Hz!” (where @, is the superconducting 
flux quantum, ®, = h/2e = 2 x 10 Gcm”), spin noise of 101, per Hz!” 
(uz, Bohr magneton), and field noise down to 10 nT per Hz”. For height 
control, we attached the SOT to a quartz tuning fork, as described 
in ref. *°. The tuning fork was electrically excited at the resonance 
frequency of ~33 kHz. The current through it was amplified using a 
room-temperature custom-built trans-impedance amplifier, designed 
based on ref. “ and measured using a lock-in amplifier. The scanning 
was performed at a constant height of 20 to100 nm above the surface 
of the top hBN. 


Technique for direct imaging of the current and evaluation of 
the current inthe incompressible strips 

To avoid the 1/f noise of the SOT that is present at frequencies below 
~1kHz, ana.c. signal from backgate modulation was acquired instead 
of measuring the local d.c. B,(r). We applied a small a.c. excitation to 
the backgate (Fig. 1a), Vig= Vig + Vie Sin(2nft), where f~3 kHz, andthe 
corresponding B> = Vi,,0B,/0V,, was then measured by the SOT using 
alock-in amplifier. Another major advantage of this modulation is that 
it provides a convenient method for direct imaging of the local current 
density /(r). To demonstrate its principle, consider a 0 gradient in the 
x direction that gives rise to anarrow strip of current of width Ax posi- 
tioned at xo, and carrying a current density /, in the y direction with a 
total current /, = Axj, (Extended Data Fig. 4a). The magnetic field B,(x) 
generated by the current and measured at a height h,o; above it is 
described by the Biot-Savart law (Extended Data Fig. 4b). For heights 
Asor> Ax, B(x) is largely governed by only the total current/,in the strip, 
independent of Ax. B,(x) is an antisymmetric function with a steep 
slope above the current strip. Its spatial derivative 0B,/Ox has a sharp 
peak at the strip location (Extended Data Fig. 4c), with a height pro- 
portional to /,, and thus can provide a good means for direct imaging 
of the current-density distribution J,(x) if this distribution can be 
modulated in space in thex direction. The backgate voltage V5, provides 
this spatial modulation, as follows. In the presence of potential gradi- 
ents, the quantum Hall edge channels flow along equipotential contours 
(given by equi-6 contours in the absence of charge disorder). A small 
Vie thus shifts the location of the channel by xq = Vig0Xo/OVpq in the 
direction parallel to the gradient and perpendicular to the current flow. 
So regardless of the gradient direction X, the measured signal will be 
given by B2° =- xf 0B,/Ox~ x3J, (x), thus providing direct imaging of 
the local current density. Extended Data Fig. 4d-f presents a simulation 
of three counterpropagating current strips demonstrating the B2° 
imaging for this case. 

The sharpness of the B%° peak is determined by Ax, hso7, x6 and the 
SOT diameter. In Fig. 2a, the scanning height above the MATBG is 
Aso = 70 nm (including hBN) and the effective SOT diameter is 220 nm; 
hence the spatial resolution is largely determined by the SOT diameter. 
For these parameters and a root mean square (r.m.s.) x§ =54nm, 
Extended Data Fig. 4c shows that a current strip that is narrower than 
~150 nm will result in a resolution-limited B%° peak (compare the solid 
and dashed lines). 

We nowuse this numerical procedure to analyse the data in Fig. 2a, 
which presents the B% signal along the white dashed line in Fig. 3a; this 
dashed line is aligned along the twist-angle gradient in a region of 
largely smooth @(r) behaviour (light brown region in the top-left corner 
of Fig. 3b, see also Supplementary Video 1). Extended Data Fig. 4g pre- 
sents an example of the BY data from Fig. 2a at V,,=—10.54 V displaying 
the BY peak (blue). The data at consecutive V,, values show that the 


peak position x, shifts with V,, at a rate of 0x,/0V,, = 1.54 um V", cor- 


responding to a twist-angle gradient ps = cia = 0.0374° pm 
Ss 
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a 
(where 75 ot: 6 2 The data were acquired with anr.m.s. voltage, 
Vig= 35 mV, mica anr.m.s. xg = Vig we =54nm. The red curve in 


Extended Data Fig. 4g shows a numerical fit to the data using these 
experimental parameters and acurrent-strip width of Ax=50 nm, result- 
ing in/'’ =1.3 pA. The good fit shows that the experimental results are 
consistent with our simulations presented in Fig. 4f-i, although the 
exact value of Ax cannot be determined because the experimental B° 
peak is resolution-limited by the diameter of the SOT. 

The extracted value /" =1.3 /A is not sensitive to precise Ax. Because 
I=0,,Aé,/e = vAe,e/h, and v= 12 in Extended Data Fig. 4g, we obtain 
Aé, = 2.8 meV, comparable to the values derived from bandstructure 
calculations (Extended Data Fig. 10a). Using Ax = 50 nm we obtain 
(in the incompressible region) an in-plane electric field of E, = Ae,/ 
(eAx) = 56 kV m ‘comparable to the simulation values in Fig. 4g. 


Topological and nontopological currents in the incompressible 
and compressible quantum Hall strips 
Gradients in the twist angle V@ give rise to gradients in the chemical 
potential Vz and to alternating compressible (when zresides withina 
Landau level) and incompressible (win the energy gap between Landau 
levels) quantum Hall strips (Fig. 4f-i). Both regions carry current”; 
however, usually only the currents in the incompressible strips, J' =0E, 
which are of topological nature, are considered, whereas the nontopo- 
logical currents in the compressible strips, J‘" = HV * |n,|z, are com- 
monly ignored (here pz, = €,/B is the magnetic moment of the orbiting 
electron and ¢, is its kinetic energy)**. The following semiclassical 
picture is instructive in describing J' and J". Under strong magnetic 
fields and in the absence of in-plane electric fields, the charge carriers 
follow cyclotron orbits that can be described semiclassically as an array 
of circles, resulting in zero average bulk current (Extended Data Fig. 5a). 
Applying an external in-plane electric field along the x direction to an 
incompressible state (F,=—0V/0x) causes the circular orbitals to convert 
into spirals drifting along the y direction, generating a current J), =06, 
(Extended Data Fig. 5b). Alternatively, applying the same exicrnalelee- 
tric fieldto acompressible strip will result in carrier redistribution that 
screens thein-plane electric field. As a result the drift current vanishes, 
but at the cost of a non-zero gradient in the carrier density, 0n,/Ox 
(Extended Data Fig. 5c). Because each orbital carries a magnetic 
moment pt, = 11,2, giving rise to a local magnetization m = |n,|f,, the 
induced 0n,/0x causes gradients in m, and hence produces equilibrium 
currents through” J‘' = V x m. This accounts for a non-zero 
JS" = HAN. |/Ox (cyan arrows in Extended Data Fig. 5c), which flows in 
the direction opposite to the topological current J" in Extended Data 
Fig. 5b. Since a full band does not contribute to current, n, inthe above 
expression refers only to carriers ina partially filled band. Alternatively, 
JN™ canbe understood as arising from uncompensated contributions 
tothe current from neighbouring orbitals in the presence ofa gradient 
in the orbital density (Extended Data Fig. 5c). 

The total current carried by the drifting orbitals in an incompressible 
strip residing between two compressible regionsis given by/}= eee dx= 

0,,AE,/e, where AE, = Eyqa1 — En iS the Landau level energy gap between 
ea compressible states and o,, = ve’/h is the quantum Hall con- 
ductance of the incompressible state (see Fig. 4). Fora more extensive 
description of J’ andJ™' see ref. °. 


Determination of twist-angle measurement accuracy and 
spatial resolution 

@ accuracy. The local twist angle is determined by the local n,(r) via 
6(r) =a,| /3n,(r)/8. The incompressible / current and the correspond- 
ing peak in the B% signal appear at specific locations where NLandau 
levels inthe dispersive bands (N=1, 2, 3,...) are exactly fully occupied, 
corresponding toa density |ny|= C Voq~ Vesela, +4N|B,|/@o for four- 
fold degenerate Landau levels, where! Vie is the backgate voltage that 


corresponds to the Nth peak. Measuring the N and N' peaks in the p 
and n dispersive bands, respectively, allows derivation of n,(r) = 
c(\viy + |\Veul)/2- 4N\B,|/@, and therefore of O(r). The absolute aucle 
accuracy is thus determined by the accuracy of C, B, andVy . Determin- 
ing Cis possible through global transport measurements and more 
accurately through local measurement of the spacing between any two 
incompressible peaks V jz '— V},=81B,1/(oC), where gis the degener- 
acy of the Landau level considered: From this we estimate our overall 
absolute accuracy of determining n, to be about +1%, thus giving an 
absolute @ accuracy of 56 =+0.005°. 

In this study, however, we are particularly interested in the relative 
accuracy of @(r) for comparing different locations r and deriving the 
angle gradients V@. The relative accuracy is determined primarily by 
the measurement precision of Vin. The sharpness of the /" peaks and 
the good signal-to-noise ratio of the B%° signal allow high-precision 
measurement of V;:% z aS demonstrated in Extended Data Fig. 6. In the 
stationary measurement in Extended Data Fig. 6a (magnification of 
Fig. 1f), V.g was swept with increments AV,, = 4.7 mV, demonstrating 
that the Vig and Viz Peak positions can be determined to anaccu- 
racy better than +AV,,, (one step size), corresponding to 

Vig ‘/Vig (4x10 * 0%. /n,, and so we have 60/0 ~ 2x 10%, ora 
relative @ accuracy of 58=+0.0002°. In Supplementary Video 1 (which 
was used to construct the full 6(r) map of device A; see Fig. 3b), larger 
increments AV,,=40 mV were used (Extended Data Fig. 6b), corre- 
sponding to a @ accuracy of 88 =+0.002°. Supplementary Videos 3, 4 
used AV,,=45 mV constructing the 0(x, y) map of device B (Fig. 3f) with 
similar accuracy. Supplementary Videos 3, 4 contain 87 frames of 
68 x 184 = 12,512 pixels each, which were acquired over a total of 42 h. 
The V,, trace of each pixel therefore took t = 12 s to acquire. The 
66<+0.002° accuracy, normalized by the pixel acquisition time, indi- 
cates that the relative 6 sensitivity per pixel in the imaging mode is 
better than Si? = ./f&0=0.007° per Hz”. 


Spatial resolution of 6(r) mapping. Our electrostatic simulations 
show that the typical width of the incompressible /' strips is about 
50 nm (Fig. 4f, h) and should be smoothed by the wavefunction width, 
of the order of magnetic length /, = ./A/eB ~ 25 nm. The position r of 
the incompressible strip provides a very accurate determination on 
the local n,(r) and @(r), and so the width of the strip essentially deter- 
mines the spatial resolution, which can be smaller than the SOT diam- 
eter. The actual spatial resolution dris determined by the accuracy 5V,, 
to which the V,, value can be assigned to the /" peak ata location r, 6r= 
5V,,0r/0V,,, where 0r/0V,, is the change in the position of" per change 
in V,,.Since/' appears at V,,(r) =[n,(r) + 4N|B,|/ol/C, the space depend- 
ence enters only through n,(r) = n,(@(r)), and thus 0r/0V,,= c(06/dr)* 
(0n,/06) 1, where 0n,/00 = 166/\/3 a. Using characteristic values C= 
2.5 x 10" V' cm”, 6V,, = 45 mV in the scanning mode and 
00/dr=0.05° um‘ gives a resolution Sr=50 nm. Smaller 06/0r gradients 
result in larger 6r. However, in sucha case, a lower spatial resolution is 
required because @ varies slowly in space. The estimated 5ris compa- 
rable to the pixel size in the videos (57 nm in Supplementary Video 1 
and 43 nmin Supplementary Videos 3, 4). We thus conclude that the 
spatial resolution 5r of the attained O(r) maps is of the order of 4-5 
moiré supercells (13 nm each). 


Local quantum Hall measurement in device A 

Extended Data Fig. 7 presents the local B2° measurement with the SOT 
at a fixed position, along with the global transport R,,. measurement 
in device A at B,=1.19 T. Alternating compressible and incompressible 
states in the region under the tip leads to a series of peaks in B2°, with 
sharp peaks corresponding to incompressible strips carrying ii The 
sign of the incompressible peaks is determined by the sign of o,,, with 
B% > 0 (B°< 0) for electron (hole) doping. In Figs. 1d, 2a the B% signal 
for p doping is multiplied by —1 for clarity. The spacing between adjacent 


peaks reflects the degeneracy of the Landau level. The dispersive band 
(shaded yellow), exhibits asequence of fourfold and eightfold degen- 
eracies. In the flat band we find fourfold degenerate levels around n,=0, 
twofold degeneracy near n, = +n,/2, and 1-fold degenerate levels near 
n, = —3n,/4 (see Fig. 1d for 1-fold degenerate levels near n, = +3n,/4). 
Evaluation of the local n, allows the extraction of the local twist angle, 
0=1.136 + 0.005°, as described below. In contrast to the sharp local B3° 
peaks, oscillations in R,,,are barely visible, owing to @(r) disorder and 
the fact that the magic-angle regions in device A do not extend over 
the entire device area. 


Landau level tomography and twist-angle mapping 

To map the local twist angle, a series of B2°(r)area scans were performed 
while varying V,,. This results in a 3D dataset with two spatial dimen- 
sions and one V,, (or equivalently, n,) axis. Each energy gap between 
adjacent Landau levels forms a 2D manifold in this 3D space with a peak 
inthe B% signal (bright regions in Extended Data Fig. 8). The manifolds 
of the lowest Landau levels in the dispersive bands trace the manifold 
of the bottom of the dispersive band, n,(r), and are displaced vertically 
fromit by the degeneracy of the Landau levels, thus providing the means 
for mapping the local n,(r) and hence the local 0(r) = a././3.n,(r)/8. The 
3D space was mapped with pixel size of -50 nm and V,, spacing between 
successive scans, AV,, ~ 40 mV, which enables us to map @(r) with an 
accuracy of 88 =+0.002° (see Methods). 

For device A, the tomographic imaging was acquired for the p dis- 
persive band for V,, spanning —8.58 V to -11.50 V with AV,,= 40 mV 
(Supplementary Video 1). The spacing between adjacent fourfold lev- 
els at B,=1.22 T was 0.39 V~10AV,,. In this device, the spatial variation 
of the charge neutrality voltage Venr(r) was found to be very small 
(Fig. 2a) and therefore n,(r) was derived from the 3D data assuming a 
constant Vinvs Representative slices of the 3D dataset are shown in 
Extended Data Fig. 8a, b. At V,, = —8.5 V, the Fermi level resides in the 
flat band for all points in space, and at V,, =-11.5 V, €, isin the dispersive 
band. Asé, moves through the bottom of the dispersive band, it crosses 
four fourfold degenerate Landau levels above n,, followed by an eight- 
fold degenerate Landau level. The black line in Extended Data Fig. 8a 
traces the N=—4 incompressible /" peak revealing gradients with occa- 
sional small jumps in the twist angle. We note that at thejump positions 
the intensity of the signal is suppressed, owing to pinning of the Landau 
levels at the steps, which reduces the amplitude of the spatial a.c. dis- 
placement x% and hence the intensity of B° (see Methods section 
‘Technique for direct imaging of the current and evaluation of the cur- 
rent in the incompressible strips’). 

We note that at any value of |V,,,| > 2,/C several different Landau lev- 
els cross €; in the bulk of the sample. Thus, despite the fact that fully 
developed Landau levels are present locally, no well defined quantum 
Hall state in the dispersive bands can be observed globally. The same 
is true for the Landau levels of the correlated states, which are also 
influenced by variations in n,(r). This explains why MATBG magne- 
totransport commonly shows SdH oscillations without displaying 
full conductance quantization, with the exception of the CNP Landau 
fan’. At high enough field, the quantum Hall quantization should be 
recovered when the Landau level degeneracy 4B,/@, exceeds the n,(r) 
variations. 

Device B exhibited stronger charge inhomogeneity and hence the 
3D tomographic imaging was acquired for both the p and n disper- 
sive bands (Supplementary Videos 3, 4) and n,(r) was derived from 
the separation between the corresponding Landau levels in the two 
bands as described schematically in Fig. 1f. The tomographic data of 
both samples has been published online”®. 

The observation of superconductivity in devices with twist-angle 
disorder of ~0.1° may be explained by either a tolerance of the supercon- 
ducting state to the exact 0 or by percolating paths along very specific 
6,,. Both of our devices show superconductivity—despite having onlya 
small overlap in their histograms in Fig. 3i-which supports the former 


explanation. Figure 2a shows, however, that Landau levels near n,/4 
and n,/2 are discontinuous and those above 3n,/4 appear only at a few 
locations, indicating the extreme fragility of the correlated states to 
twist-angle disorder. 


Mapping of the charge disorder 

Similarly to the mapping of the twist angle disorder through n,(r) = 
CLV,.(r) — V_,.(r)]/2, the tomographic imaging also allows mapping of 
the charge disorder 6n,(r) = ng(r) — Ng, where n,(r) = C[V,,(r) + 
V_,s(') 1/2, as presented in Fig. 3h for device B. Extended Data Fig. 9 
shows the histogram of 6n,(r) along with a Gaussian fit with a standard 
deviation An, = 2.59 x 10" cm”, which is comparable to high-quality 
hBN-encapsulated monolayer graphene devices“ and substantially 
lower than in graphene on SiO,*. Note that in contrast to 
hBN-encapsulated graphene, the MATBG fabrication process is cur- 
rently incompatible with thermal annealing procedures for disorder 
reduction. We observe that the charge disorder in device B is notably 
larger than in the magic-angle regions in device A (Fig. 2d), which we 
ascribe to the fact that in contrast to device A, device B did not undergo 
surface residue cleaning by AFM. 

Note that the tomographic method allows mapping of the twist-angle 
and charge disorders only in the magic-angle regions, where Landau 
levels are present. In device A, a large part of the sample did not show 
magic-angle physics (Supplementary Videos 1, 2 and Fig. 3a, b), whereas 
the magic-angle regions revealed very low charge disorder with an esti- 
mated standard deviation Ang~1.3 x 10 cm “as attained by several 1D 
scans, suchas in Fig. 2a, d. We therefore performed tomographic imag- 
ing of only the p dispersive band, which does not permit extraction of 
the full 2D map of 6n,(r) in device A. Neglecting this low level of charge 
disorder introduces an error in the derived 0(r) map of device A of 
60 <0.0015°, whichis negligible compared to the span of 0(r) in Fig. 3b. 


Bandstructure calculations and Landau level crossings 
The bandstructure of twisted bilayer graphene can be computed from 
an effective continuum Hamiltonian””*° *: 


Hx H® ut 
U_ HY? 


where H® is the valley-dependent monolayer graphene Hamiltonian 
for layer i: 


H® =-nu(k-K®) - (0,,0,) 


with Fermi velocity v,, €=+1 indicating the positive and negative valleys, 
K the k-space location of the respective Dirac points in layer i, and U 
the interlayer coupling***: 


ya(4 W\,( ua wo \iccpr,( U U’O)jeeres- 
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Here, u = 0.0797 eV and uw’ = 0.0975 eV are coupling constants* that 
give the strength of the interaction between like (A = A, B= B) and 
opposing (A < B) sublattices in the two layers, the difference of which 
accounts for out-of-plane corrugation, w=e?"?, and U'isthe Hermitian 
conjugate of U. The moiré reciprocal lattice vectors, Gj'= a - a), are 
given by the difference between the reciprocal lattice vectors in the 
upper (a) and lower (a) layers. 

Magnetic field effects can be included by making the substitution 
k>k+ eA/hinthe effective Hamiltonian. Here, Ais the vector potential, 
whichis related to the static magnetic field via B= V x A. In general, the 
bandstructure in a magnetic field cannot be computed because the 
addition of a spatially dependent vector potential breaks translational 
invariance. However, at certain values of the magnetic field—specifi- 
cally when SB/(h/e) = p/q, where p and q are co-prime integers and Sis 
the area of the unit cell—a ‘magnetic’ unit cell can be introduced 
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whereupon it becomes possible to solve the Schrédinger equation 
using the corresponding ‘magnetic’ Bloch conditions”. It is then pos- 
sible to construct a Hamiltonian matrix in the basis of the monolayer 
graphene Landau levels***. Although the Landau levels basis is 
unbounded, the Hamiltonian matrix can be truncated at an energy 
where the higher-energy Landau levels only weakly affect the 
low-energy spectrum. This cut-off energy must be much larger than 
the interlayer coupling characterized by the coupling constants uand 
u’. The resulting finite matrix can then be diagonalized. This results in 
a bandstructure diagram in terms of p/q that is directly related to the 
strength of the magnetic field and indirectly related to the twist angle, 
because the moiré unit cell area, S=./3 a?/[8sin?(6/2)], depends on @. 
For avarying magnetic field or twist angle the bands are computed for 
each individual parameter value assuming that these values are homo- 
geneous throughout the material. 

Level crossings in the bandstructure are observed as the mag- 
netic field or the twist angle are varied. These occur as a result of the 
‘Rashba-like’ splitting of the dispersive bands. In general, this type of 
splitting leads to two Landau level series, largely overlapping in energy, 
which cross asa function of magnetic field (Extended Data Fig. 10a), 
and as a function of 6 (Extended Data Fig. 10b), as is the case in the 
experimental data. These are due to the evolution of the Rashba-like 
splitting with 6 (Extended Data Fig. 10c-e). 


Origin of the internal in-plane electric field and numerical 
electrostatic simulations 

The origin of the internal electric field in presence of twist-angle gra- 
dients can be understood intuitively as follows. An external in-plane 
electric field applied to graphene—for example, by charge disorder in 
the substrate—exerts a force on the electrons. Thermal equilibrium 
conditions require zero net force on the carriers, and so charge redis- 
tribution will occur, creating an opposing electric field and leading to 
screening of the external field, thus achieving the required zero net 
force. This is the common situation in metals. In the case of MATBG, 
the backgate voltage induces a nearly uniform carrier density, while the 
twist-angle disorder induces a variable chemical potential u(r) (Fig. 4b), 
which exerts an in-plane force on the carriers, F, =—Vy. To attain zero 
net force in thermal equilibrium, an in-plane electric field E, = Vu/q 
must therefore be generated by carrier redistribution. Thus, in con- 
trast tocommon charge disorder, in which the system tends to screen 
external electric fields, in the case of twist-angle disorder the system 
counterintuitively spontaneously generates internal electric fields to 
counterbalance the force produced by the variable chemical potential. 
These internal fields do not generate current at zero magnetic field; 
however, ina finite magnetic field transverse topological and nontopo- 
logical currents will be induced in the ground state. 

For the results presented in Fig. 4, simulations (using the COMSOL 
Multiphysics analysis software) were used to solve electrostatic 
equations for the potential Vand charge density p =—en, at B, =O and 
in the quantum Hall state at B, = 1.22 T. The simulations included a 
backgate electrode at aconstant electric potential V,, and a grounded 
MATBGina3 x 0.5 um’x-zbox, assuming translation invariance along 
the yaxis, with boundary conditions of E, =O onthe external surfaces 
of the box. An iterative self-consistent solution for V(x, y, z) and 
p(x, y) was obtained, satisfying the following conditions: (1) Vdepends 
on p through V-E = p/€,€) and E = -VV, where €, is the relative 
permittivity (we took ¢,=4 for hBN) and €)is the vacuum permittivity 
(here V = (0,, 0,, 0,) is the three dimensional operator); and (2) p 
depends on V through the integrated DOS N(u;6), where 
u(x, y) =-qV(x, y) and g= +e is the carrier charge (negative sign for 
p> 0). The integrated DOS \V.(u; 6) was calculated for B=0 and B#0 
as described in the Methods section ‘Bandstructure calculations and 
Landau level crossings’. 

Once V(x, y,z=0) and p(x, y) were found in the plane of the MATBG, 
the incompressible surface currents were calculated using J' =—oVV, 


where o,,(x, y) =—0,,.(x, y) =-v(x, y)e’/h, and 0,,,= 0,,= 0 are the compo- 
nents of the conductivity tensor o. 


Measurement parameters 
Allthe measurements were carried out at 7=300 mK in an out-of-plane 
applied magnetic field B,. 


Figure 1b and Extended Data Fig. 2b. Device B, /,-=10 nA (r.m.s.), 
Vig=—15 Vto lS V. 


Figure 1d. Device B, B, = 1.08 T, SOT diameter 250 nm, scan height 
40 nm, Vi, =20 mV (r.m.s.), AV,, = 6.25 mV, acquisition time 6 s per 
point, total acquisition time 8 h. 


Figure 1f and Extended Data Figs. 6a, 7. Device A, B, = 1.19 T, SOT di- 
ameter 220 nm, scan height 100 nm, Vig =15mV (r.m.s.), Vig=4.7 mV, 
acquisition time 6s per point, total acquisition time 12h. 


Figure 2. Device A, B, =1.22 T, SOT diameter 220 nm, scan height 60 nm, 
Vie=35 mV (r.m.s.), pixel size 26 nm, 160 ms per pixel, total acquisition 
time 21.4h. 


Figure 3a and Supplementary Video 2. Device A, B, = 1.16 T, SOT di- 
ameter 220 nm, scan height 110 nm, bg =80 mV (r.m.s.), pixel size 
60 nm, 60 ms per pixel, acquisition time 60 min per frame. 


Supplementary Video 1 and Extended Data Figs. 6b, 8a, b. Device A, 
B,=1.22T, SOT diameter 220 nm, scan height 60 nm, bes 35 mV (r.m.s.), 
pixel size 57 nm, 60 ms per pixel, acquisition time 30 min per frame. 


Figure 3e. Device B, B, = 1.08 T, SOT diameter 250 nm, scan height 
140nm,V5,=60 mV (r.m.s.), pixel size 50 nm, 60 ms per pixel, acquisi- 
tion time 33 min. 


Supplementary Videos 3, 4 and Extended Data Fig. 8c, d. Device B, 
B,=1.08T, SOT diameter 250 nm, scan height 70 nm (Video 3) and 80nm 
(Video 4),Vi,= 60 mV (r.m.s.), pixel size 43 nm, 60 ms per pixel, acqui- 
sition time 25 min per frame. 


Extended Data Fig. 2a. Device A, /,.=10 nA (r.m.s.), Vig=—17 V to 17 V, 
Si backgate 50 V. 


Extended Data Fig. 3a. Device A, /,,=5nA (t.m.s.), Vig =—6 V to -3.5 V, 
Si backgate 50 V. 


Extended Data Fig. 3b. Device B, /,.=4 nA (r.m.s.), Vig=-7 V to -4 V. 


Extended Data Fig. 3c. Device B, /,.=10 nA (r.m.s.), Vig =—6.5 V to 
-4.7V,B,=OT. 


Data availability 


The data that support the findings of this study are available from the 
corresponding authors on reasonable request. 
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device A device B 


Extended Data Fig. 1| Optical image of MATBG devices. a, Optical image of measurements. b, Optical image of device B (cyan) on the PdAu backgate (light 
device A showing hBN/MATBG/hBN (green), the underlying PdAu backgate blue) with marked electrodes. 
(light brown) and the marked electrodes used for four-probe R,, 
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Extended Data Fig. 2| Transport measurements at T=300mK.a,Four-probe Thegreensolid lines showthe segments that can be traced in the data and the 
measurement of R,,(V,,) versus B, in device A using an excitation current of dotted lines indicate their extrapolation to the origin. b, Asina for device B. 
10 nA with the corresponding traces of the Landau fan diagram at the bottom. The purple colour marks the regions where the R,, signal was slightly negative. 
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Extended Data Fig. 3 | Transport measurements inthe superconducting (black) is observed in device B.c, dV/d/ versus /,. characteristics at various 
state at T=300 mK. a, b,Colour rendering of R,,. measured in the vicinity of carrier concentrations n, inthe superconducting state in device Bat B, =OT 
-n,/2 versus B, and n, at low fields using anr.m.s. excitation current of 5nAin using anr.m.s. a.c. excitation /,,=10 nA. 


device A (a) and 4 nA in device B (b). A zero-resistance superconducting state 
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Extended Data Fig. 4| Numerical simulation demonstrating current 
imaging by measuring B*. a, Current distribution/,(x- Xp) ofa 
Ax=50-nm-wide channel carrying /,=1 pA in the direction. b, Calculated 
-BAx-Xp) at a height of 70 nm above the sample, convoluted witha 
220-nm-diameter SOT sensing area. c, Calculated B3°(x — x9) for anr.m.s. 

x =54nm spatial modulation of the channel position. The dashed profile 
corresponds toacurrent strip of width Ax=150 nmcarrying the same current, 
showing that the spatial resolution is limited by the SOT diameter. d-f, Asin a-c 
but for three counter-propagating currents spaced 150 nm apart. g, Analysis of 
the B® peak of an incompressible strip. B2°(x) signal (blue) acquired along the 


line indicated in Fig. 3a for V.g=—10.54 V (a single vertical line from Fig. 2a) 
showing the v=—12 incompressible peak, along with a numerical fit (red). The 
fit uses the experimental values of Vi, Asor and the SOT diameter with a single 
fitting parameter of the total current inthe incompressible strip resulting in 

I =1,3 pA. Anincompressible strip of width Ax=50 nm was used for the fit. The 
mean value of B2°(x) was subtracted from the data. The asymmetry in B2°(x) 
away from the peak is caused by the presence of counterflowing 
nontopological currents /‘' of lower density in the adjacent compressible 


strips. 
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Extended Data Fig. 5| The origin of equilibrium currents in the acquire a drift velocity resulting ina non-zero J; inthe incompressible state. c, 
compressible and incompressible quantum Hall states. a, Semiclassical Inthe compressible regime the external in-plane electric field is screened by 
picture of cyclotron orbits of holes with mutually canceling neighbouring establishing a charge-density gradient, giving rise to ge flowing inthe 
currents, resulting in zero bulk current. b, Inthe presence of anin-plane opposite direction (cyan arrows). 


electric field £, (+ and — signs represent external charges) the cyclotron orbits 
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Extended Data Fig. 6 | Determination of the accuracy of the twist-angle 
measurement. a, Traces of —B3° versus V,,in device A (from Fig. 1f) acquired 
witha step size AV,,=4.7 mV and anr.m.s. V5, =15 mV. The positions of the Voz 
and Vie? peaks can be determined to an accuracy better than +AV,, (onestep 
size), corresponding toa relative 0 accuracy of 59=+0.0002°.b, Asina, taken 


from Supplementary Video Lat a pixel position (x, y) =(2.53 pm, 5.9 um) with 
step size AV,, = 40 mV and anr.m.s.Vi,=35 mV, resulting ina relative 6 accuracy 
of 69=+0.002° in the imaging mode. The larger B* signal and the broader /" 
peaks inb compared to aare due to larger Vj, excitation (see Methods section 
‘Measurement parameters’). 
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Extended Data Fig. 7 | Resolving the local quantum Hall states in flat and incompressible strips with sign determined by the sign of o,,, magnitude by the 
dispersive bands in device A. Global R,,. (purple, right axis) and local B3° Landau level energy gap and separation by the Landau level degeneracy (red 
(blue, left axis) measured at a point in the bulk of device A versus the electron bars). The dispersive bands are shaded in yellow and the signal in the flat bands 


density n, at B,=1.19 T. The sharp B® peaks reflect the/' current in is amplified six times for clarity. 


device A 


Extended Data Fig. 8 | Landau level tomography. a, Slices of the 3D dataset 
BY (x,y, Vog) along various planes for device A. The bright signals denote the 2D 
manifolds tracing the incompressible states. The black lines trace the N= —4 
incompressible manifold used to determine n,(x, y) and 0(x, y). Itseparates 
fourfold degenerate Landau levels belowit from an eightfold degenerate 
Landau level above it (wide dark blue band). The region in the centre of the 
sample that shows no Landau levels corresponds to the grey-blue areain Fig. 3b 
where no MATBG physics is resolved. b, Representative horizontal slices of the 


data from Supplementary Video 1 showing the evolution of the Landau levels 
with V,,.c, Asin a, for device B. For the range of gate voltages shown, ¢; lies in 
the p dispersive band for the entire sample. The black lines show an example of 
a trace of the incompressible manifold lying above an eightfold degenerate 
Landau level. d, Representative horizontal slices of the data from 
Supplementary Video 3. Aninteractive interface for tomographic visualization 
of the datais available at ref. *°. 
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Extended Data Fig. 9 | Histogram of the charge disorder in device B. Histogram of 5n,(r) data from Fig. 3h along with a Gaussian fit (black) with a standard 
deviation Any=2.59 x10" cm”. 
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Extended Data Fig. 10| Landau level crossings of the dispersive bands. 

a, Numerically calculated Landau level energies as a function of magnetic field 
for afixed @=1.05°. An example level crossing is highlighted in red. 

b, Numerically calculated Landau level energies as a function of 6 for a fixed 


B,=1.22T. Anexample level crossing is highlighted in red. c-e, The B, =O 
bandstructure of bilayer graphene for 6 =1.05° (c), 9=1.16° (d) and 9=1.27° (e). 
The blue and red lines indicate the bands that arise from the positive and 
negative valleys, respectively. 
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Thermoelectric generation using the anomalous Nernst effect (ANE) has great 
potential for application in energy harvesting technology because the transverse 
geometry of the Nernst effect should enable efficient, large-area and flexible coverage 
of a heat source. For such applications to be viable, substantial improvements will be 
necessary not only for their performance but also for the associated material costs, 
safety and stability. In terms of the electronic structure, the anomalous Nernst effect 
(ANE) originates from the Berry curvature of the conduction electrons near the Fermi 
energy’”. To design a large Berry curvature, several approaches have been considered 
using nodal points and lines in momentum space*”’. Here we performa 
high-throughput computational search and find that 25 percent doping of aluminium 
and gallium in alpha iron, a naturally abundant and low-cost element, dramatically 
enhances the ANE by a factor of more than ten, reaching about 4 and 6 microvolts per 


kelvin at room temperature, respectively, close to the highest value reported so far. 
The comparison between experiment and theory indicates that the Fermi energy 
tuning to the nodal web—a flat band structure made of interconnected nodal lines—is 
the key for the strong enhancement in the transverse thermoelectric coefficient, 
reaching a value of about 5 amperes per kelvin per metre with a logarithmic 
temperature dependence. We have also succeeded in fabricating thin films that 
exhibit a large ANE at zero field, which could be suitable for designing low-cost, 


flexible microelectronic thermoelectric generators 


11-13 


Thermoelectricity, the conversion of heat current into electric 
energy, provides a key technology for versatile energy harvesting 
and for heat flow sensors. There is a rapidly growing demand for novel 
energy-harvesting technology to power Internet of Things sensors and 
wearable devices, particularly in the form of flexible, durable micro- 
thermoelectric generators (u-TEGs). So far, the technology has relied 
on the longitudinal thermoelectric response known as the Seebeck 
effect" ®. Recently, its transverse counterpart in ferromagnets, the 
anomalous Nernst effect (ANE), has gained increasing attention, as it 
has anumber of potential benefits® *’°’. For example, the transverse 
geometry of the Nernst effect enables efficient, large-area and flexible 
coverage of acurved heat source (Fig. 1a, left). It substantially reduces 
the number of production processes as well as the contact resistance 
of athermopile, compared with a conventional thermoelectric device 
(Fig. 1a, right). In addition, the transverse geometry is hypothetically 
better suited for thermoelectric conversion as the Ettingshausen heat 
current should support the Nernst voltage, whereas the Peltier heat 
current may suppress the Seebeck voltage’®. However, the ANE is too 
small compared with the Seebeck effect for any thermoelectric appli- 
cation. Thus, it is important to develop an approach to design a new 


class of materials that exhibit a large ANE at zero field. Moreover, as 
the use of rare and toxic elements would pose a barrier for applica- 
tions, low-cost and safe elements should be used for thermoelectric 
materials. 

Here we introduce two iron-based cubic compounds Fe,X (where 
X is Ga or Al) as materials suitable for designing such low-cost, flex- 
ible p-TEGs, in particular by using their thin-film forms. As detailed 
in Methods, our successful fabrication of their thin films enables us 
to obtain a large ANE of about 4 pV K ‘for Fe,Ga and about 2 pV K“ for 
FeAl at room temperature using an in-plane temperature gradient 
(Extended Data Fig. 3a). In addition, our films of Fe,Ga and Fe,Al have 
in-plane magnetization with a coercivity B, of about 40 Oe and 20 Oe, 
respectively, and exhibit a spontaneous ANE at zero field (Fig. 1c) for 
the out-of-plane temperature gradient. This enables us to design a 
much simpler p-TEG than the conventional Seebeck analogues (Fig. la, 
Extended Data Fig. 4)!°. Moreover, the specific power generation capac- 
ity [p= Pyax/(A(AT)) is estimated to be of the same order as or potentially 
greater than that of conventional )-TEGs (Methods), where P,,,,,A and 
ATare the maximum power, the area of the thermopile device and the 
temperature difference, respectively” ™. In the following, we discuss 
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Fig. 1| Transverse thermoelectric conversion, iron compounds and a nodal 
web. a, Thermopiles based on the ANE (left) and the Seebeck effect (right) have 
the transverse (VL AT) and longitudinal (V|| AT) geometries between the 
voltage (V) and temperature difference (A7), respectively. Compared with the 
pillar structure using p- and n-type semiconductors for the conventional 
thermoelectrics (right), the thermopile structure using the ANE may beas 
simple as, for example, a film type (yellow) deposited ona flexible substrate 
(left)!°. M, magnetization. b, Crystal structures of Fe,X (where X is Ga or Al) 
(left) and a-Fe (right). c, Magnetic field dependence of the Nernst voltage -V,, 
for the thin films of Fe,Ga, Fe,Al and a-Fe. As -V,, depends on the sample 


how we discovered these materials, their detailed physical properties 
using bulk single crystals and the origin of the large ANE. 

Recent developments in the understanding of topological aspects 
in electronic transport led to the discovery of the large anomalous 
Hall and Nernst effects in spin liquids and antiferromagnetic met- 
als}?.10.1719-22 It also fostered the finding that designing the electronic 
structure to locate the nodal points called Weyl points near the Fermi 
energy is crucial to enhance the Berry curvature and thus for ferro- 
magnets to exhibit a large ANE beyond the conventional linear scaling 
law with magnetization® °. The most popular ferromagnet is elemental 
alphairon (a-Fe), which adopts a body-centred cubic (bcc) structure. 
Although its natural abundance is the highest among the magnetic 
elements, it does not exhibit a large ANE by itself” (Fig. 1c, Extended 
Data Fig. 3a). It would be useful for applications such as versatile 
energy harvesting and heat flow sensors if we could find a large ANE 
iniron-based compounds. To find candidate compounds efficiently, 
we used a high-throughput computational method to screen materials 
without synthesizing them first. 

The inverse band-structure problem, namely identifying a material 
with given electronic properties, has a 20-year history”®. Recently, trig- 
gered by the combination of advanced ab initio calculation methods 
and intelligent data-mining techniques for huge databases, in silico 
design has made great progress in a variety of fields in materials sci- 
ence”*”’, However, high-throughput materials design for transport 
properties suchas the ANE still remains a major challenge, as accurate 
calculation of not only one-particle quantities (such as the Kohn-Sham 
energy) but also two-particle quantities (such as the response function) 
is required. In addition, for the ANE, we need to calculate the Berry 
curvature using a sufficiently dense k-mesh in the three-dimensional 
(3D) Brillouin zone (BZ). 

To reduce the formidable numerical cost for materials screening, we 
recently developed a code (a new version of pw2wannier in Quantum 
Espresso; Methods) that enables us to perform an efficient Wannier 
interpolation for calculations with relativistic ultrasoft pseudopo- 
tentials. We then performed a calculation for roughly 1,400 magnets 
available in the Materials Project” (Methods, Supplementary Table 1) 
to provide acomprehensive set of calculations for the transport prop- 
erties of magnets. Among more than 250 iron-based compounds, 
we focused on binary compounds from the perspective of future 
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geometry and measurement conditions suchas AT, we performed the 
measurements for all films with the same conditions (thickness, measurement 
set-up and so on). The temperature gradient in the sample (MgO substrate/ 
Fe,Ga, Fe;Al or a-Fe/MgO), V7, is monitored to be 3.0 Kmm”~. The temperature 
gradient in the Fe,Ga (Fe,Al) layer is estimated to be V7= 0.86 Kmm? 

(1.0 K mm) (Methods). d, Energy dispersion of the conduction band (red) and 
the valence band (blue) around the L point on the BZ boundary for Fe,Ga. The 
nodal web is made of nodal lines, shown by the yellow lines (the same asin 

Fig. 4d), along which the flat portions of the conduction and the valence bands 
toucharoundtheL point. 


applications. In Table 1, we list the five most promising candidates 
that would potentially exhibit a large ANE, with a transverse thermo- 
electric coefficient a,, of more than 2.4 A Km". Here we select Fe;Ga 
as a target material (Fig. 1b, Methods, Extended Data Figs. 1, 2), owing 
toits large |a,,| and toits high Curie temperature 7. ~ 720K (Table 1)” ®. 
In addition, its sister compound Fe,Al (7. = 600 K) is also of great inter- 
est fundamentally and technologically, as aluminium and iron are the 
most abundant metal elements in Earth’s crust. Furthermore, as both 
compounds have the same cubic structure (Fig. 1b), we find all the 
properties to be nearly isotropic, which is convenient for application 
(Supplementary Information, Extended Data Figs. 6, 8, 9). 

First, we present our experimental observation of the giant ANE 
in Fe,Ga and Fe,Al at room temperature. Figure 2a shows that the 
field dependence of the anomalous Nernst coefficient, S,,, for Fe;Ga 
and Fe, Al saturates to a value as high as around 6 pV K ‘and around 
4uV K+, respectively, comparable to the value reported for Co,MnGa 
(ref. ’). This is striking as it indicates that 25% gallium doping leads to 
20-fold enhancement in the ANE, compared with about 0.3 pV K" for 
a-Fe” (Fig. 2a), which shares the same bcc structure as Fe,Ga and Fe, Al 
(Fig. 1b). In addition, the Hall resistivity saturates to about 3 LOcm, one 
order magnitude larger than about 0.2 1.0 cm for a-Fe”’ (Fig. 2b). As 
shown in Fig. 2c, -S,,(7) exhibits a 7-linear increase at low temperatures 
and saturation on heating above room temperature. The Hall resistivity 
also exhibits a systematic increase on heating for the entire range of 
temperatures (Fig. 2d). 

In general, the anomalous Nernst coefficient is composed of two 
terms, thatis, S,,=QPyy- FxPyySxx Where p,, and o,, are the longitudinal 
resistivity and Hall conductivity, respectively (Supplementary Infor- 
mation, Extended Data Figs. 6, 7)'°. Namely, the transverse charge flow 
comes from the transverse thermoelectric current a,,(-V 7) and the 
Hall current. Figure 3a shows the temperature dependence of -a,,(7). 
Fe,Ga and Fe,Al share a common feature: a nearly isotropic broad peak 
following the 71n7 form, as we will discuss. It is also notable that —a,,(7) 
in Fe,Ga peaks at around 200K, reaching around 5.2AK‘m_“, thelargest 
value for any magnetic compound. By contrast, the Hall conductivity is 
nearly Tindependent for both compounds, ranging between 350 and 
600 O'cm' (Fig. 3b), the size of which is consistent with the scaling 
law with the longitudinal conductivity o,,.for the intrinsic mechanism 
of the anomalous Hall effect (Supplementary Information, Extended 
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Fig. 2|Large anomalous Nernst and Hall effects of Fe;X. a, b, Magnetic field B 
dependence of the ANE -S,, (a) and the anomalous Hall effect p,, (b) measured 
at room temperature. The magnitude of Bin the horizontal axis is corrected for 
the demagnetization effect. The data for a-Fe are also plotted for comparison”. 
c,d, Temperature dependence of -S,,.(c) and p,, (d) measured under a magnetic 
field of 2 T along [100]. B, Qand I refer to the directions of the magnetic field, 
heat and electric currents, respectively. 


Data Table 1)’. Asa result, S,, is dominated by the component a,,(-V7) 
for Fe,Ga, as shown in Extended Data Fig. 7. 


. 2 KaT ( 00 
In ferromagnets, the Mott relation a,, = - = [ ~ 


a ia can (where kz, 
e, EandF, are the Boltzmann constant, elementary charge, energy and 
Fermi energy of the electrons, respectively) normally holds at T« T, 
and thus a,, exhibits the Fermi liquid 7-linear dependence. In fact, 
-a,,/T shows a nearly 7-constant behaviour at low temperatures for 
both Fe,Gaand Fe,Al (Fig. 3a, inset). To further examine the 7 depend- 
ence, in Fig. 3c we plot g(7/Thy) = (Qx/Qyx( Tin) (Tn/T) Versus T/T), where 
gis anormalized transverse thermoelectric coefficient and T,, refers 
to the temperature where |a,,| attains its maximum (Extended Data 
Table 1). Interestingly, g for both Fe,Ga and Fe,Al shows a In7 depend- 
ence up to the highest temperature of our measurements, and shows 
saturation at large values below 7/T,, = 0.5. Similarly, we also plot 
0,,/Oy( Tm) for both compounds versus 7/7, and find that they roughly 
scale at 7/T,, > 0.5 (Fig. 3d). 

Using the Berry curvature expression for intrinsic o,,at 7=0 given by 
Oy (Ef) == (e?/A)Ynk On AK)O(Es - En,,), the Mott relation can be re- 


: 2 kT e? P 
written as ay. = "STF Dink On,2(k)6(E; - &;,), where the summation 


is taken for all the occupied states, O,, ,(k) isthe Berry curvature along 
the Z direction, nis the band index, O(¢) is the unit step function, 6(e) 
is the Dirac delta function and h, k and €,,,, are Planck’s constant, 
the wavevector and the energy of the electrons in band n, respectively. 
Thus, it is essential to find the mechanism to simultaneously enhance 
the density of states and Berry curvature at the Fermi surfaces to obtain 
a large —a,,/T as observed in Fe,Ga and Fe,Al. Using the Kubo formula, 


{n | vy | n’<n’ | vy | n) 


the Berry curvature can be described asQ,, =i Lwen 


(€n- En)” 
Here, |n) and ¢, are the cell-periodic Bloch eigenstates and eigenvalues, 
andv, and v, are velocity operators along directions y and v. The veloc- 


ity operator is defined as Oyy= ee, using the k-dependent 
‘H(v) 


Table 1 | The five most promising iron-based binary 
compounds obtained from the high-throughput 
computation 


Formula Space group Onax (A Km") T,(K) 
Fe,Pt Pm3m 6.2 450 
Fe,Ga Fm3m 3.0 720 
Fe,Al Fm3m 2.7 600 
Fe,Si Fm3m 2.5 840 
Fe,N Pm3m 24 760 


The compounds are sorted by the size of a,,,,, that is, the maximum value of a,, estimated at 
the Fermi energy at T < 500 K. T, is the experimentally obtained Curie temperature”?**. 


Hamiltonian H(k) =e “He where ris the position vector and His the 
original Hamiltonian. This equation indicates that the Berry curvature 
would be strongly enhanced in the vicinity of the gapped line structure, 
which originally belongs to the nodal lines before introducing the 
spin-orbit coupling (SOC) (Fig. 1d). Moreover, the density of states 
(DOS) would be highly increased when the nodal lines are not only 
energy independent but also interconnected, forming a quasi-2D net- 
work. In the following, we show that such a ‘nodal web’ structure may 
arise when two nearly flat valence and conduction bands touch around 
ahighly symmetric point (Figs. 1d, 4b-d) and eventually lead toalarge 
Berry curvature on inclusion of SOC. 

Figure 4a shows the band dispersion calculated using density func- 
tional theory (DFT) along highly symmetric lines in the momentum 
space. The numerically obtained o,,, and 0o,,/0E exhibit a clear kink and 
an associated peak around 60 meV, respectively (Extended Data Fig. 5a, 
b). Correspondingly, we find that the temperature dependence of a,,/T 
below about 200 K depends strongly on the energy £ around 60 meV. 
In particular, a,,/7 exhibits the logarithmic temperature dependence 
over a wide T range at 62 meV (Extended Data Fig. 5c). Notably, using 
their own T,, (Extended Data Table 1), experiment and theory are in 
agreement when plotting g= (@),/Qy.(Tm))(Tn/T) versus T/T, (Fig. 3c). 
This indicates that the experimental results can be well reproduced by 
assuming that F is located at about £, = +74 meV for Fe;Ga and about 
E,=+86 meV for Fe,Al. Moreover, the reasonable agreement can be 
also found in the temperature dependence of the Hall conductivity 
by using the same set of parameters (Fig. 3d). In addition, theory indi- 
cates that the dominant band is made of minority-spin states and thus 
the upward shifts in F with respect to F, can be naturally understood 
given that the magnetization is slightly overestimated in theory (Sup- 
plementary Information). 

Figure 4b shows the 3D network made of the nodal lines calculated 
for Fe,Ga without introducing SOC. All the nodal lines go through the 
whole BZ and are interconnected with each other, instead of forming 
aclosed loop inside the BZ. In particular, we highlight the nodal-line 
network witha nearly flat energy dispersion at F~ +74 meV as the nodal 
web (red and blue in Fig. 4b, c, yellow lines in Fig. 4d) around the L point. 
As two mirror eigenstates should be degenerate within each mirror 
plane (green shaded planes in Fig. 4d), the symmetry-protected six 
nodal lines branch off from another nodal line connecting the F and 
L points (Fig. 4c). 

After introducing SOC, a gap opens along the nodal lines and induces 
the Berry curvature. We expect that the Berry curvature becomes sub- 
stantially enhanced near the momenta that originally belong to the flat 
nodal web, suchas the one around the L point. This is because along the 
nodal line, the gap between the two bands forming the nodal web should 
be small (Supplementary Information, Extended Data Fig. 5d-f) and, 
moreover, the flat energy dispersion must enhance the DOS associated 
with the Berry curvature. 

To confirm this, we show the contour plot of the Berry curvature in 
Fig. 4e. The strongly enhanced Berry curvature is particularly seen at 
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Fig. 3 | Giant transverse thermoelectric conductivity for Fe,X. 

a, b, Temperature dependence of the transverse thermoelectric conductivity 
-a,, (a) and the Hall conductivity —o,, (b) obtained for various directions of B. 
Inb, we define the data symbols for the combination of Q (I) and B directions 
for aandb. Both-a,, and -o,, exhibit nearly isotropic behaviours. The inset ina 
shows ~a,,/T versus T.c, d, Scaling relations of —a,, (c) and o,, (d) versus 7/T,, for 
M|| [100]. Here, Mand 7,, refer to the magnetization and temperature at which 
|a,,| attains its maximum (Extended Data Table 1). The solid lines are obtained 
from first-principles calculations at = +74 meV (orange) and F=+86 meV 
(green). Theinsetinc shows the -In|E - £,| dependence (black solid line) of 

ee =- qa withq=4 ts calculated in the vicinity of F,=+62 meV above the 
Fermi energy. The vertical broken lines indicate the characteristic energies 
E=74 meV (orange) and F= 86 meV (green). Nearly isotropic character is found 
for M|| [100], [110] and [111] (Supplementary Information, Extended Data Fig. 9). 


the momenta connecting the edge of the nodal web around the LL point 
onthe BZ boundary, extending over a quasi-2D area spanned by the web. 
This situation could be comparable to the case of the band edge of the 
1D system being accompanied by a logarithmic increase of the DOS as 
a function of energy. Similarly, d0,,/0E exhibits a logarithmic energy 
dependence near F = 62 meV, indicating a divergent behaviour of the 
Berry curvature (Fig. 3c, inset). This leads to the -In7 dependence in 
a,,/T found at 62 meV (Fig. 3c, Extended Data Fig. 5), the breakdown of 
the Mott relation. Such -In7 dependence has been previously discussed 
for the Lifshitz quantum critical point between typeI and type Il Weyl 
fermions found for Co,MnGa (ref. ’). Our scenario indicates that even 
without having Weyl points nearby, the formation of a flat nodal web 
may lead to the logarithmic enhancement of the transverse thermo- 
electric conductivity a,,. 

Our discovery of the large spontaneous transverse thermoelectric 
effects indicates that two iron-based compounds, Fe,Ga and Fe, Al, 
should be suitable for designing low-cost, flexible p-TEGs by using their 
thin-film form. To gain a stable performance in daily use, itis important 
to enhance the coercivity further. Finally, it would be also an inter- 
esting future direction to look for an enhanced output by combining 
the ANE with the spin Seebeck effect, both of which occur inthe same 
transverse geometry** *”. 

Note added in proof: During the review process of this paper, we 
became aware of a work by Nakayama etal.*®, the received date of which 
is one week later than ours. While they observed an enhancement in 
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Fig. 4| Evidence for the nodal web structure. a, Band structure around the 
Fermi energy. Red (blue) lines correspond to up (down) spin bands computed 
without SOC. Dashed lines show the bands calculated including SOC. 

b, BZ showing the nodal-line network. The nodal web formed around theL point 
is highlighted by the colour, which shows the energy dispersion of the nodal 
lines ona linear scale in a narrow range between 0.04 eV (blue) and 0.08 eV 
(green) above the Fermi energy. c, Three-dimensional network forming the 
nodal web around the LL point. The kww., kyy and k,;, axes indicate the directions 
parallel to W-W’, K-U and I-L, respectively. The nodal line (green) connecting 
thel andL points branches off into three nodal lines (red-blue) at the L point. 
d, Nearly flat energy dispersion of the nodal lines (yellow) forming the web 
structure around the L point, on the plane perpendicular toI[-L. The kyw-and 
kyy axes are the same as those inc and the zaxis indicates the energy. A 
higher-energy (red) anda lower-energy (light blue) band touches with each 
other sharing gapless nodal lines (yellow lines, the same as in Fig. 1c) along the 
mirror planes (shaded green planes) in the case without SOC. e, Contour 

plot of the Berry curvature Q,, , of the lower-energy band n (ind) around the 
Lpoint and nodal lines. The z direction is along [001] parallel to the 
magnetization. The Berry curvature was integrated within [-Ak, Ak] along the 
kr axis; On (kww ky) =f dk Qn, 2 (Kwwe Kyu, kp_)- 4k was chosen to be 
sufficient to include the nodal web structure. The Berry curvature is strongly 
enhanced inthe extended region around the nodal web structure (yellow lines) 
(Methods, Extended Data Fig. 5d-f). 


the ANE by gallium doping in bcc-Fe in its thin-film form, the materi- 
als used in their study are in a disordered phase, not the ordered DO, 
phase as studied in this work. Moreover, the maximum of|S,,|in Nakay- 
amaetal.*’ is less than half of the values found in our experiment and 
does not appear at x = 0.25 (that is, Fe,Ga). As a result, the transverse 
thermoelectric conductivity a,, is about five times smaller than our 
estimate, highlighting the importance of both the ordered DO, crys- 
tal structure and the stoichiometry of Fe,Ga. Our work also contains 
the experimental results using not only thin films but also bulk single 
crystals over a wide temperature range with different field orientations. 
The combination of our comprehensive experimental observations 
and theoretical analysis have revealed that a ‘nodal web’ structure in 
the vicinity of the Fermi energy plays an important role in enhancing 


the Berry curvature and thereby the transverse thermoelectric effect. 
In addition to the in-plane heat current configuration, we performed 
ANE measurements with the heat current applied perpendicular to the 
thin film, which reveals the zero-field spontaneous anomalous Nernst 
voltage with a coercivity of about 40 Oe in Fe,Ga. 
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Methods 


Single-crystal growth and experimental methods 
Single crystals of Fe;X (where X is Ga or Al) were prepared by the 
Czochralski method after making polycrystalline samples by arc melt- 
ing. As-grown single crystals were used for all the measurements. Our 
analyses using both inductively coupled plasma spectroscopy and 
energy dispersive X-ray analysis indicate that our single crystals are 
stoichiometric within a few percent resolution. The samples were ori- 
ented by the Laue backscattering method and subsequently cut by spark 
erosion. The bar-shaped sample with flat polished surfaces was used 
for all the transport measurements, including the electric resistivity 
and Hall, Seebeck and Nernst effects ina commercial system (thermal 
transport option (TTO) of physical property measurement system 
(PPMS), Quantum Design). A schematic of the measurement setup is 
provided in Extended Data Fig. 3b. The three gold-plated copper leads 
were attached to the sides of arectangular parallelepiped sample with 
silver epoxy, which were connected to one resistive heater and two 
(hot and cold) Cernox1050 thermometers by screws. Two copper wires 
were also attached on the side surfaces of the sample with silver epoxy, 
which were used to measure the Hall and Nernst voltages. One of the 
edges of the sample was directly clamped to the heat sink of the TTO 
puck by the screw. The magnetic field B was applied along [100], [110] 
or [111]. The electric (thermal) current I (Q) was applied perpendicular 
to B, namely I (Q) || [001] or [011] for B || [100], [001] or [110] for 
B|| [110], and [112] for 1 (Q) || [111] without a transverse component (adi- 
abatic Nernst effect). All the transport properties represent steady-state 
values, which were taken under a constant heat/electric current. Mag- 
netization was measured by using acommercial SQUID magnetometer 
(MPMS, Quantum Design) after reshaping the single crystal into a tiny 
piece (about 0.1 mg). During the measurement, the magnetic field was 
fixed at each measurement point in the persistent mode. 

The anisotropy in the Nernst and Hall effects and magnetization 
was found to be small, as we discuss in Supplementary Information 
(Extended Data Figs. 6, 8, 9). 


Crystal structure 

Fe,X has a bcc derivative structure, namely, an ordered DO, structure 
(Fm3m), partially ordered B, structure (Pm3m), which corresponds to 
stoichiometric Fe,X phase, and fully disordered A, structure (/m3m). 
Our single crystals were confirmed to have a DO, structure by X-ray 
diffraction (XRD) (Extended Data Fig. 1a, b) and selected area electron 
diffraction (Extended Data Fig. 1c). The XRD pattern is well reproduced 
by the Rietveld analysis as shown by the solid line in Extended Data 
Fig. 1a, b, indicating the single phase of the DO, Fe,X with a lattice con- 
stant of a=5.80 A for Fe,Gaand a=5.79 A for FeAl. For Fe,Al, (111) peak 
is clearly observed around 26 = 27° (where 26 is the angle between 
incident and diffracted X-ray beam directions), indicating the DO, 
ordered structure. For Fe,Ga, the (111) peak is expected to be very small 
(about 0.8% of the (220) peak) and cannot be observed in our experi- 
ment. Nevertheless, we detect the signal from (111) by the electron 
diffraction as shown in Extended Data Fig. 1c. While the (111) and (200) 
reflections are detected by acharge-coupled device camera for Fe, Al, 
they are detected by only an imaging plate with long exposure time for 
Fe,Ga. Note that the effect of multiple reflection and diffraction from 
the oxide appears due to the high sensitivity of the imaging plate and 
long exposure time. For the powder XRD measurements, we annealed 
the sample at 630 °C for about 10 h for Fe,Ga, and 400 °C for about 
20h for Fe,Al after making powder from single crystals to remove the 
strains and distortions introduced by the crushing procedure. 


Fabrication and characterization for Fe,Al and Fe,Ga thin films 

Fe,Al films were fabricated on MgO(001) substrates by a molecular 
beam epitaxy method under ultrahigh vacuum (UHV). The base pres- 
sure of the UHV chamber was below 2 x 108 Pa. Before the deposition, 


the substrates were annealed at 800 °C for 10 min inthe UHV chamber. 
An MgOlayer (5nm) was deposited ata rate of 0.1As ‘at room tempera- 
ture onthe MgO substrate as a seed layer for the Fe,Al layer. The Fe,Al 
layer (50 nm) was deposited at a rate of 0.2 As at room temperature 
and subsequently it was annealed at 450 °C for 30 min. 

Fe,Ga films were fabricated on MgO(001) substrates by a d.c. mag- 
netron sputtering method. The base pressure of the chamber was 
below5 x10” Pa. Before the deposition, the substrates were annealed 
at 800 °C for 10 minin the chamber. An Fe,Ga layer (50 nm) was depos- 
ited at arate of 0.4 As at room temperature and subsequently it was 
annealed at 500 °C for 30 min. 

The compositions of the FeAl and Fe,Ga layers were estimated 
to be Fe, ,Al,, and Fe, »,Gag3, respectively, by scanning electron 
microscopy-energy dispersive X-ray spectroscopy. To prevent oxida- 
tion, the Fe,Al and Fe,Ga layers were both capped with MgO (5 nm) 
under the same conditions as described for the seed layer. 

The structural properties of the Fe,Al and Fe,Ga films were analysed 
by XRD measurements using a Cu-Ka source. Extended Data Fig. 2a 
shows the XRD patterns (20/w-scan) of the Fe,Al and Fe,Ga films made 
by the solid phase epitaxy. We confirmed the (002) and (004) peaks 
in the Fe,Al and Fe,Ga films indexed by the Bravais lattice of the bcc 
structure with the lattice constants of c=5.71A and c=5.74 A, respec- 
tively. Extended Data Fig. 2b presents the XRD patterns (@-scan) of 
the {202} planes of the Fe,Al and Fe,Ga films, and the MgO substrate. 
The reflections of both Fe,Al and Fe,Ga films show fourfold symme- 
try with 90° intervals. These XRD patterns indicate that these layers 
are grown epitaxially with a well-defined in-plane orientation. As the 
peaks of Fe,X {202} are shifted by 45° with respect to the corresponding 
{202} peaks of MgO, the crystallographic relationship is determined to 
be Fe;X(001)[110]||MgO(001)[100]. The presence of the superlattice 
(111) peak was further examined to determine whether the films are 
chemically ordered in the DO, type. Extended Data Fig. 2c shows the 
(111) peaks in the XRD pattern (26/w-scan) for the Fe,Al and Fe,Ga thin 
films, indicating the DO, structure. 


Magneto-thermoelectric transport properties of Fe,Al and 
Fe,Ga thin films 
To estimate the anomalous Nernst coefficient S,, of the Fe,Ga and Fe,Al 
thin films, we performed the Nernst voltage measurements with the 
temperature gradient V 7 along both the in-plane and the out-of-plane 
directions. For the measurements using the former ‘in-plane’ V Tcon- 
figuration, we employed the same apparatus (TTO of PPMS) and the 
same temperature gradient configuration as used to measure the bulk 
Fe,Ga and Fe,Al samples (see ‘Single-crystal growth and experimental 
methods’ and Extended Data Fig. 3b). Note that V7 in the thin film is 
determined by the V7 inthe substrate for the ‘in-plane’ V 7 configura- 
tion, as the thermal resistance of the sample (MgO substrate (500 pm)/ 
Fe,X (SO nm)/MgO capping layer (5 nm)) is dominated by the MgO 
substrate as follows. The total thermal resistance 6,,.q is described as 
1/6 cotat = 1/O substrate + V/Or6,x + 1/8 cap, Where Osubstrater OFe x and 6.ap are the 
thermal resistance of the MgO substrate including the seed layer, Fe,X 
(50 nm) and MgO capping layer, respectively. Each thermal resistance 
can be estimated by using the sample dimensions as described in 
Extended Data Fig. 3b as well as the thermal conductivity of MgO and 
Fe3X, Kygo ~ 60 W K* m™ and k..x = 20 W K* m" (refs. **°). As 
A.uvstrare* 150 K Wis much smaller than both 6, x 4.5 x 10° K W ‘and 
Beap * 1.5 x10’ K W*, 1/6;.,x and 1/6... are negligibly small and thus we 
donot need to consider any contributions other than the one fromthe 
MgO substrate to estimate the in-plane temperature gradient of the 
sample. Independently, we carried out the thermal conductivity meas- 
urement of the same sample and confirmed that the thermal conduc- 
tivity is estimated to be56 WK ‘m4, whichis consistent with the above 
estimate based on literature”. 

Extended Data Fig. 3a shows the field dependence of the anomalous 
Nernst coefficient for the in-plane temperature gradient V7 (heat 


current Q || [110]) and the perpendicular magnetic field (B || [001]) at 
300 K. The Nernst coefficient is estimated to be |S,,|=4.0 pV K‘, 
1.8 pV K ‘and 0.3 pV K “for the Fe,Ga, Fe,Al and a-Fe (50 nm) thin films, 
respectively, consistent with the bulk results. Note that the a-Fe (50 nm) 
film was epitaxially grown by the molecular beam epitaxy method on 
the MgO(001) substrate with the same in-plane orientation as the Fe,X 
films, and the linear-in-B ordinary Nernst effect N ~-0.05 pV KT 7 is 
subtracted for the a-Fe sample. 

In addition to the abovementioned ‘in-plane’ V7 and out-of-plane 
B configuration, we performed the Nernst effect measurements with 
‘out-of-plane’ V 7 and in-plane B, which is a more practical configura- 
tion for the magneto-thermoelectric devices. 

The films used for the ‘out-of-plane’ V7 measurements are sand- 
wiched from top to bottom, in the following order, as illustrated in 
Extended Data Fig. 3c: [(1) Resistive heater] + [(2) Copper heat sink] + 
[(3) Thermal interface silicone soft pad (TIP) (SOO pm)]+[(4) MgO cap- 
ping layer (5nm)]+[(5) Sample: Fe,Ga, Fe,Al or a-Fe (50 nm)]+[(6) MgO 
substrate (500 pm)]+[(7) TIP (SOO pm)]+[(8) Copper heat sink]. Here 
we used a thermal interface silicone pad TC-SOHSV-1.4 (thermal con- 
ductivity, «~1.3 WK7m‘; thermal resistance, about 1.1K W‘; Shin-Etsu 
Silicone) for both TIP layers. In the measurements, the magnetic field 
and heat current were respectively applied to [110] and [001] directions 
of each film. As the stacking structure is complicated, it is important 
to make further confirmations of the Nernst effects using several inde- 
pendent methods to estimate the temperature gradient/difference. 

Figure Ic shows the field dependence of the Nernst voltage of the 
Fe,X and a-Fe thin films at room temperature. A7,, is monitored to be 
1.5K inallthe samples by two calibrated thermocouples placed onthe 
top and bottom surfaces of the samples (Extended Data Fig. 3c). The 
Fe,Ga (FeAl) sample shows a sharp hysteresis with a zero-field value 
of 19.8 pV (10.6 pV) and coercivity of 40 Oe (20 Oe). These signals are 
much larger than those obtained for a-Fe. Anisotropic character of the 
anomalous Nernst coefficient, as confirmed inthe bulk Fe,X measure- 
ments (Supplementary Information, Extended Data Fig. 8), allows us 
to adopt the in-plane Nernst coefficient (Extended Data Fig. 3a) to 
estimate the applied out-of-plane temperature difference. The tem- 
perature difference and temperature gradient of the Fe,Ga (Fe, Al) layer 
are estimated to be about 4.5 x 10° K and 0.9 K mm ‘(5.4 x 10° K and 
1.1K mm’), respectively. These values are consistent with the tem- 
perature difference and gradient estimated by another independent 
method, as we will discuss below. 

To estimate the temperature difference and gradient applied inthe 
Fe;X film (SO nm), the Nernst effect of an a-Fe epitaxial thin film (50 nm) 
was measured under the same conditions as in the case of the Fe,Al 
film. By assuming that the a-Fe film has the same anomalous Nernst 
coefficient S,,and thermal conductivity as those reported for the bulk 
O-Fe (S\xo-re = 0.3 WV K™ (ref. ”) and Ka.r¢=80 W Km" (ref. “)), the tem- 
perature gradient, temperature difference and heat flux of the a-Fe film 
(50 nm) can be estimated to be V 7,.-.= 0.19 K mm, AT,.-.=1.0 x 10° K 
and Q,.;.~ 0.21 W, respectively. 

As we use the same stacking sequence and temperature gradient 
(difference) for both a-Fe and Fe,X on MgO (that is, V 7,,=3.0 Kmm”, 
AT, =1.5K), the heat current of Fe,X is expected to be almost the same, 
that is, Qre,x ~ 0.21 W. By using the thermal conductivity k..¢ 
(Kre,a) =19.0 (17.4) WK 1m‘ of Fe;X presented in Extended Data Fig. 6 
(Supplementary Information), the temperature gradient and tem- 
perature difference applied in the Fe,Ga (Fe,Al) film can be estimated 
tobe V Tre cq =0.82Kmm ‘and A Tre ga= 4-110 °K (V Trea) = 0.89 Kmm* 
and ATy.,,1= 4-5 * 10 °K), respectively. These are nearly the same values 
as those calculated using the Nernst voltage (Fig. 1c) and the Nernst 
coefficients obtained in the ‘in-plane’ V T configuration (Extended Data 
Fig. 3a), indicating that our method to estimate the temperature gradi- 
ent is reasonable. Another way of confirming the validity for the 
‘out-of-plane’ V7 measurement is to examine the Nernst coefficients 
using the above estimates of the temperature difference and the Nernst 


voltage. The obtained Nernst coefficients are estimated to be 
Sy = 4.4 pV K “and 2.2 pV K” for the Fe;Ga and FeAl 50-nm thin films, 
respectively, which are nearly the same values as those found using 
the Nernst effect measurements in the ‘in-plane’ V7 configuration 
(Extended Data Fig. 3a). 


Estimation of the specific power generation capacity 

p= Pmay/(A(AT)’) 

In this section, we evaluate how much power can be obtained from 
the p-TEG using the anomalous Nernst effect of Fe,X. Extended Data 
Fig. 4 shows a schematic of the p-TEG based on the anomalous Nernst 
effect'*. The thermopile consists of an alternating array of the Fe,X 
(yellow) and electrode (gold) wires placed ona substrate. Two-hundred 
wires, whose thickness, width and length are designed to be t=1 um, 
w=20umand/=9 mm, respectively, are aligned in the substrate with 
the area of A=1cm/?. Owing to the lateral configuration of the anoma- 
lous Nernst effect, atemperature difference ATis applied perpendicu- 
lar to the plane and the resultant voltage is V/AT=LS,,/t = 2001S,,/t= 
10.8 V K™ for Fe,Ga (S,,~ 6 pV K") and 7.2 VK” for FeAl (S,,4 VK"), 
where is the total length of the material. Note that this ‘geometrically 
enhanced’ Nernst voltage is comparable to the p-TEG output for the 
25,000 p-n pairs of Bi,Te;. For practical applications, the main con- 
cern is not widely used efficiency or figure of merit Z7, but specific 
power generation capacity [=P a/(A(A7)), which indicates how much 
power is generated from a temperature difference AT (refs. "). 
[, for our p-TEG design is estimated to be /p = LwS,,’/(4tp)/(1.cm’) = 
324 LW (K? cm?) for Fe,;Ga and 144 pW (K? cm?) for Fe,Al. The values 
estimated here are the hypothetical upper limit. Although the actual 
values may be reduced by a factor of about ten in a practical p-TEG, 
they would be still comparable to or higher than those reported for 
conventional 1-TEGs to date” ®, 


High-throughput calculation of thermoelectric conductivity 
For the crystal structures in the Materials Project”®, the electronic struc- 
tures were calculated within the generalized gradient approximation 
using the Quantum Espresso code. The crystal structure was analysed 
using cif2cell* and ultrasoft pseudopotentials inthe PSLibrary** were 
used. The magnetic moment was assumed along the zaxis and the SOCs 
were included. Wannier interpolation was automatically performed by 
using the wannier90 code* with the appropriate initial orbitals and 
energy ranges; that is, the atomic orbitals whose energy level is higher 
than —-1 Rydberg (Ry) were selected as initial orbitals and the Kohn- 
Sham orbitals with the energy range below £, of +1 eV were fully used 
to construct the Wannier orbitals. The script used for this calculation 
is available at https://github.com/wannier-utils-dev/cif2qewan/. For 
each Wannier interpolated Hamiltonian, we confirmed that the energy 
difference between the interpolated band and the original DFT band 
is less than 0.2 eV. In terms of the computational cost, we limited the 
number of atoms in the unit cell to be less than eight atoms. With this 
criterion, the number of successfully obtained Wannier Hamiltonians 
is 1,419, including 265 iron-based compounds. The list of a,, calculated 
for all the 1,419 compounds are shown in Supplementary Table 1. 

For all these Wannier Hamiltonians, the intrinsic contribution of 
the anomalous Hall conductivity o,, and the anomalous transverse 
thermoelectric conductivity a,, were calculated through the Berry 
curvature formula as 


2 
O71, (T, H) i — Gyr nll (1) 


where e, h,¢,f, wand Q, , are the elementary charge with a negative 
sign, the reduced Planck constant, the band energy, the Fermi-Dirac 
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distribution function with the band index n and the wavevector k, 
the chemical potential and the z component of the Berry curvature, 
respectively. 

To select the candidates, we first sorted the list by the magnitude 
Of Qnax, Which refers to the maximum value of |a,,| estimated at the 
Fermi energy below 500 K. Then, we focused on the iron-based com- 
pounds as iron is the least expensive element. Finally, we removed 
paramagnets, antiferromagnets and the materials that do not exist in 
past experimental reports. 


DFT calculation 

The electronic structures of Fe;X were obtained by using the OpenMX 
code*®, where the exchange-correlation functional within the 
generalized-gradient approximation and norm-conserving pseudo- 
potentials” were employed. The spin-orbit interaction was included 
by using total angular momentum-dependent pseudopotentials**. 
The wave functions were expanded by a linear combination of multi- 
ple pseudoatomic orbitals”. A set of pseudoatomic orbital basis was 
specified as Fe6.0-s3p3d3, Al7.0-s3p3dl1 and Ga7.0-s3p3d3, where the 
number after each element stands for the radial cutoff in the unit of 
Bohr and the integer after s, p and d indicates the radial multiplic- 
ity of each angular momentum component. The cutoff energies for 
a charge density of 800.0 Ry and a k-point mesh of 35 x 35 x 35 were 
used. The lattice constants of Fe,Ga and Fe,Al were set to 5.80 A and 
5.79 A, respectively). The evaluated magnetic moments of Fe,Ga 
and Fe,Al are 6.69 Bohr magneton per formula unit (uu, f.u.’) and 
5.93 Us f.u., respectively, which are close to the experimentally 
obtained values and consistent with previously calculated results>??. 
The nodal lines were obtained by monitoring the degeneracy of 
eigenvalues in the momentum space based on the electronic 
structure without SOC. 


Wannier representation and transport properties 

From the Blochstates obtained inthe DFT calculation described above, 
a Wannier basis set was constructed by using the wannier90 code*. The 
basis was composed of d-character orbitals localized at each Fe site, 
(s,p)-character ones at the Al sites and p-character ones at the Gasites, 
that is, 38 orbitals per f.u. of Fe,Al and 36 orbitals per f.u. of Fe,Gain 
total, including the spin multiplicity. These sets were constructed from 
the Bloch functions in the energy window ranging from—15 eV to SO eV, 
which include 92 bands for Fe,Al and 110 bands for Fe,Ga. 

The anomalous Hall conductivity o,, and the anomalous transverse 
thermoelectric conductivity q,, that intrinsically appear in crystalline 
systems can be expressed with the out-of-plane component Q, .(k) 
of the Berry curvature as described in equations (1) and (2). 0,, and 
@,, Shown in Fig. 3c, d were calculated according to the above equa- 
tions in the Wannier representation. For the numerical integration, a 
k-point mesh of 100 x 100 x 100 and in addition an adaptive mesh of 
Nadp = 3 X 3 x 3. in regions with large Q,,, were used. This adaptive mesh 
is already sufficiently large as we confirmed that the relative change of 
-0,, calculated with N,g,=3 x 3 x 3 and 6 x 6 x 6 is less than 1%. 

Extended Data Fig. 5a, b shows the energy dependence of the Hall 
conductivity -o,,and the transverse thermoelectric conductivity esti- 
mated using the Mott relation, namely a,,/T=— q(00,,/0E)r_,, with 


m2 kh 


q=3 Extended Data Fig. 5c shows the temperature dependence of 
-a,,/T calculated using equation (2) for various energies. 


Data availability 


The data that support the plots within this paper and the other find- 
ings of this study are available from the corresponding author upon 
reasonable request. 
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Extended Data Fig. 1| Evidence for the DO, structure of Fe,X.a, b, XRD 
pattern for Fe,Ga (a) and Fe, Al (b) obtained by Cu-Ka radiation (A=1.5401 A)at 
room temperature. The solid circles and the solid line (red) represent the 
experimental results and the Rietveld refinement fit, respectively. The final 
weighted and expected R indicators and goodness-of-fit indicator S are 

Ryp= 2.34%, Re= 1.49% and S$ =1.55 for Fe;Ga and Rwp= 1.91%, R, = 1.27% and 


| 
60 70 80 90 100 
20 (degree) 


1 1/nm 


Fe, Al 


S=1.48 for Fe,Al, respectively. Vertical bars (green) below the curves indicate 
the major peak positions calculated for DO; Fe,;Ga and Fe,Al, which are more 
than 1% of the main peak. The lower curve (orange) represents the difference 
between the experimental result and the Rietveld refinement. /, intensity. 

c, Selected area electron diffraction pattern for our single crystals of Fe,Ga 
(left) and Fe;Al (right) taken from the [110] plane. 
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Extended Data Fig. 2 | Evidence for DO; structure of Fe3Ga and Fe;Al thin bottom. b, g-scan patterns of the {202} planes of the Fe;Ga and Fe;Al layers, and 
films. a, Room temperature spectra obtained by XRD 26/w-scans for the Fe,X the MgO substrate. c, 20/w-scan patterns for the (111) plane of the Fe,Ga and 
thin films onan MgO substrate and the MgO substrate itself. The theoretical Fe,Al thin-film layers. 
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Extended Data Fig. 3 | ANE of Fe;X and experimentalsetupforbothbulkand — gradient.b, Schematic of the experimental setup for the ANE measurement 
thin films. a, Magnetic field dependence of the ANE obtained for the Fe;X thin using the in-plane temperature gradient. c, Schematic of the experimental 
films (SO nm) and the a-Fe thin film (50 nm) using the in-plane temperature setup for the ANE measurement using the out-of-plane temperature gradient. 
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Extended Data Fig. 4 | Schematic of the p-TEG based onthe ANE. The substrate and these two wires are connected ina zigzag configuration. A 
thermopile consists of asquare-shaped substrate (black frame) and an temperature gradient is applied perpendicular to the plane. The thickness of 
alternating array of Fe;X (yellow) and gold wires (brown) placed on the the wire is designed to be1 pm. 
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Extended Data Fig. 6| Longitudinal electric and thermal transport 
properties and magnetization for Fe,X. a, b, Temperature dependence of the 
longitudinal resistivity p,.. for Fe,Ga (a) and Fe,Al (b). c,d, Temperature 
dependence of the Seebeck coefficient S,.. for Fe,Ga (c) and Fe,Al (d). 

e, f, temperature dependence of the thermal conductivity x,,. for Fe,Ga (e) and 
Fe,Al (f). #100, #110 and #111 represent the samples used for the transport 
measurements in B|| [100], [110] and [111], respectively. The inset in b shows the 
7 dependence of p,, for Fe,Ga #100 (red) and FeAl #100 (blue). The inset inc 
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shows the log-log plot of -S,.. versus 7 for Fe,Ga #100 (red) and Fe,Al #100 
(blue). The solid and broken lines represent the Tand 7'° dependence, 
respectively. The solid and broken lines in e and fshow the estimated electric 
and lattice contributions to the thermal conductivity (Supplementary 
Information). g,h, Magnetization curve for Fe,Ga (g) and Fe;Al (h) at T=5K 
under B || [100], [110] and [111]. The insets in g and hare the 7°? dependence 
of M(T). 
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Extended Data Fig. 7 | Specific heat and two contributions tothe 
anomalous Nernst effect. a, Temperature dependence of the specific heat 
divided by temperature C/T for Fe;X. The solid lines represent the fit by the 
combination of the electronic and Debye-type phonon specific heat and 


ferromagnetic magnon contribution (Supplementary Information). The inset 
shows the 7"? dependence of C/T at low temperatures. b, ¢, @,,p (b) and 
-Oy,PS,(€). The ANE is the sum of the two terms, thatis, S,,= Gy. — Oy,PSxx 
(Supplementary Information). 
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Extended Data Table 1| List of the scaling parameters and magnitude of o,, and o,, at T= O 


M He —AyTm) | ~ Oa Tmn) |o,,(T ~ 0) |o,,(T ~ 0) 
unit - K AK-*m~* On'cm-! O-'cm" | e4/(2ha) | Q'cm-" | e2/(2ha) 
[100] 200 5.2 610 
Fe,Ga [110] 210 5 570 ~600 ~0.9 ~31000 ~47 
[111] 200 5.1 550 
[100] 180 3.6 460 
FeAl [110] 180 3.8 460 ~370 ~0.55 ~20000 ~30 
[111] 180 3.3 460 
[100] 320 3.3 730 
74 meV | [110] 340 2.8 610 
[111] 340 a7 590 
[100] 370 3.3 670 
86 meV | [110] 370 2.8 560 
[111] 380 2.7 540 


The first two rows indicate the experimental values obtained for Fe,Ga and Fe,Al and the last three rows the theoretical values obtained for E = +74 and E = +86 meV. Each has three consecutive 
rows corresponding to the orientation of the magnetization. Here, T,,, is the temperature where the transverse thermoelectric conductivity |a,,| attains its maximum |a,,(T,,)|- 
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When miniaturizing fluidic circuitry, the solid walls of the fluid channels become 
increasingly important’ because they limit the flow rates achievable for a given 


pressure drop, and they are prone to fouling”. Approaches for reducing the wall 
interactions include hydrophobic coatings’, liquid-infused porous surfaces* ©, 
nanoparticle surfactant jamming’, changes to surface electronic structure’, 


electrowetting” 


, surface tension pinning 


1112 and use of atomically flat channels”. 


A better solution may be to avoid the solid walls altogether. Droplet microfluidics and 
sheath flow achieve this but require continuous flow of the central liquid and the 
surrounding liquid’"*. Here we demonstrate an approach in which aqueous liquid 
channels are surrounded by an immiscible magnetic liquid, both of which are 
stabilized by a quadrupolar magnetic field. This creates self-healing, non-clogging, 
anti-fouling and near-frictionless liquid-in-liquid fluidic channels. Manipulation of the 
field provides flow control, suchas valving, splitting, merging and pumping. The latter 
is achieved by moving permanent magnets that have no physical contact with the 
liquid channel. We show that this magnetostaltic pumping method can be used to 
transport whole human blood with very little damage due to shear forces. Haemolysis 
(rupture of blood cells) is reduced by an order of magnitude compared with 
traditional peristaltic pumping, in which blood is mechanically squeezed througha 
plastic tube. Our liquid-in-liquid approach provides new ways to transport delicate 
liquids, particularly when scaling channels down to the micrometre scale, with no 
need for high pressures, and could also be used for microfluidic circuitry. 


To avoid contact with the walls of a device, magnetic forces can be 
used to levitate particles or live cells ina paramagnetic liquid. Nearly 
wall-less microfluidic channels have been demonstrated in which con- 
tinuous ‘magnetic antitubes’ of water surrounded by an aqueous para- 
magnetic salt solution were stabilized” by using an electromagnet and 
irontracks. However, the antitube lifetime was limited by ion interdif- 
fusion between the two liquids, the salts were toxic, and contact with 
one stationary wall containing the track could not be avoided. Here 
we overcome these limitations, creating entirely wall-less microfluidic 
channels consisting of diamagnetic antitubes completely enclosed by 
immiscible, and in select cases, non-toxic, paramagnetic fluids. 

The key to wall-less magnetic confinementis an extended quadrupolar 
flux source, leading to a null magnetic field along a line at the centre 
(Fig. 1a, b). Appropriately magnetized Nd,Fe,,B bars are used to define 
tubular channels, or else custom-made magnet bilayers are waterjet-cut 
to define more complex fluidic circuitry. The magnetic bars are housed 
in 3D-printed supports with conventional microfluidic inlet and outlet 
ports (Fig. 1and Extended Data Fig. 1). The various magnetic fluids used 
are listed in Table 1; stronger confinement is achieved with ferrofluids, but 
optical transparency is possible with our paramagnetic oil, which we call 
‘Magoil’. Ferrofluids are colloidal liquids of magnetite (Fe;0,) nanoparti- 
cles suspended ina carrier fluid, whereas Magoil is arare-earth-based oil, 


inspired by the contrast agents based on diethylenetriaminepentaacetate 
(DTPA) that are used for magnetic resonance imaging” (Extended Data 
Fig. 2,and Methods). We used commercial ferrofluids (Ferrotec APG311 
and EMG900, 2 vol% and 13 vol% magnetite in hydrocarbon oils respec- 
tively; and Qfluidics MKC and MD4, 29 vol% magnetite in hydrocarbons 
and 5 vol% magnetite in perfluorodecalin, respectively). 

Aqueous antitubes were typically formed by injecting ferrofluid into 
water-filled quadrupole channels and were visualized by X-ray imaging 
and tomography (Fig. Ic, d, f, Extended Data Fig. 3c—e). Tomography 
images unambiguously reveal the wall-less features, confirmed by visual 
inspection of transmitted light through antitubes up to 1m long (Extended 
Data Fig. 1g). Optical imaging was found to be possible through ferrofluid 
no more than 200 pm thick and with a high-contrast camera (Extended 
Data Fig. 3f, g), where we can take advantage of the optical resolution to 
image the smallest antitube features. We were able to image antitubes 
in transparent Magoil with standard optical or fluorescent microscopy 
by adding contrast ink or fluorescent dye to the water antitube, allowing 
real-time visualization of antitube extrusion and retraction (Supple- 
mentary Video 1). Furthermore, the trapped gas bubbles that are often 
problematicin conventional devices can easily be removed, as their buoy- 
ancy in Magoil overcomes the magnetic confinement. For most practical 
applications, however, ferrofluids are preferred because of their much 
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Fig. 1| Wall-less magnetic confinement ina fluidic channel. a, Permanent 
magnets (red, blue) in anin-plane quadrupolar configuration create a low-field 
zone at the centre, where an antitube of water (yellow) is stabilized inside an 
immiscible magnetic liquid (white). b, Contour plot of the magnetic field. 

c, Synchrotron X-ray tomographic reconstruction of a water antitube (yellow) 


Antitube (water) 


larger magnetic susceptibility y (Table 1); they can withstand consider- 
able flow rates while remaining confined in the quadrupole (Extended 
Data Fig. 4a, b). A1-mm-diameter liquid tube can deliver a flow of about 
40 ml min”. In addition, the phase transfer of magnetite nanoparticles 
into water is very low, with measured iron concentrations mostly below 
1 ppm (see Methods section ‘Quantitative analysis’). 

The liquid-in-liquid design offers advantages of stability and robust- 
ness for fluid transport. Figure 2a illustrates self-healing after an 
antitube in ferrofluid was severed with a spatula. Recovery without 
applied external pressure is rapid (self-healing in Magoil is illustrated in 
Supplementary Video 2). The antitubes cannot clog: when glass beads 
are introduced (Fig. 2b), they are easily flushed out. Evenabead much 
larger than the antitube diameter can be pushed through using minimal 
pressure (20 mbar; Fig. 2c). The liquid walls of the antitube stretch to 
avoid clogging and return to their original size when the obstruction is 
expelled. A change of external pressure alters the antitube diameter. In 
Extended Data Fig. 4c, d, it can be seen that antitubes remain unchanged 
with externally applied pressure for an open outlet at atmospheric 
pressure, but dilate when the same pressure is applied with the outlet 
closed off. This can also be seen in Fig. 2c, where the tube dilates behind 
the bead to accommodate the increased local pressure. As a demon- 
stration of anti-fouling behaviour, we used laser light to covalently 
crosslink a polymer inside an antitube, and were able to remove the 
resulting solid polymer rod from the outlet of the device (Extended 
Data Fig. 6a—c, and Supplementary Video 3). 

A further advantage of liquid-in-liquid flow is near-frictionless trans- 
port with negligible pressure drop. An illustration is shown in Fig. 2d 
and Supplementary Video 4, where flow of a magnetically confined 
antitube made of honey (dynamic viscosity 7, = 10 Pas) is compared 
with honey flow in a standard tube. We observed an antitube flow of 
39.4 + 0.7 gh, about 70 times faster than the flow through a conven- 
tional plastic tube of the same diameter d=1.1mm (0.55+0.10 gh"). 
The friction inside a normal tube is quantified by its slip length, defined 
as the extrapolated distance relative to the boundary where the veloc- 
ity reduces to zero. Here, the ferrofluid acts as alubricating layer, with 
an effective slip length b at the honey/ferrofluid boundary that can 
be approximated by 


2 
b= 4) |S yt -d w 


with diameter 81 pm, surrounded by ferrofluid (blue). d, Optical end view ofa 
water antitube in ferrofluid.e, X-ray end-view cross-section from tomographic 
data at y=4 mm. f, X-ray side-view cross-section from tomographic data at 
x=1mm. Scale bars (black/white), 2mm. 


where dis the antitube diameter and Qyp is the non-dimensional flow 
rate given by 


128Qn, 
=— oh 2 
Ono id *aP/dz @) 


where Qis the flow rate and OP/0z is the pressure gradient in the 
flow direction (for derivation, see Methods section ‘Derivation of 
equations’). Experimentally, we obtain an effective slip length of 
4.3mm, meaning that under these conditions the flow is essentially 
plug-like. Under the assumption of an infinite channel, the plug-like 
velocity profile can be calculated, as shown analytically in Extended 
Data Fig. 4e and using numerical methods in Extended Data Fig. Sa—d, 
leading to a very long theoretical effective slip length of 8.1 mm. 
Remarkably, the flow rate of honey through the antitube was 1.5 times 
as fast as when it fell freely and unconfined (Fig. 2d, centre), prob- 
ably owing to competition between orifice wetting’’ and the higher 
hydrostatic pressure due to the greater height of the honey column 
in the antitube design. 

The stable confinement of an antitube, at equilibrium, results from 
the competing magnetic energy of the confining fluid and the sur- 
face energy oof the magnetic/non-magnetic interface. Inserting these 
energy densities into the magnetically augmented version of Bernoulli’s 
equation” gives the equilibrium diameter of the antitube (see Methods 
section ‘Derivation of equilibrium diameter equation’): 


#2 4o (3) 
2M H,+ UyMj 


where H,, M, are the magnetic field and magnetization values at the 
interface, {1p is the permeability of free space, and M is the field-averaged 
magnetization of the confining fluid. This equation is valid when the 
magnetic pressure, /1,H is significantly larger than any difference 
in hydrostatic pressure. It can then be linearized when M = yH, under 
the geometrical conditions w < 2h (Fig. la) andd< Yaw, typical of our 
devices, where wis the spacing between the magnets. The linear model 
gives the minimum equilibrium dimensionless diameter d* = d/w as: 


= 1/3 
| 4 
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Table 1| Characteristics of the magnetic fluids 


Magnetic liquid Susceptibility, x Viscosity,n(mPas)* Appearance 
Magoil 47x*10% 600 Transparent 
APG311 ferrofluid 0.151 70 Opaque/black 
EMG900 ferrofluid 1.67 60 Opaque/black 
MKC ferrofluid 1.88 380 Opaque/black 
MD4 ferrofluid 0.315 180 Opaque/black 


*At a shear rate of 100 s* 


where Np = H)Mzw/ois the magnetic confinement number expressing 
the ratio of magnetic to surface energies (see Methods section ‘Deriva- 
tion of linear and saturation models’), and M, is the remanent mag- 
netization of the permanent magnets. Note that a1,000-fold increase 
iny reduces d* by a factor of 100, revealing the importance of the mag- 
netic susceptibility of the confining fluid. 

We have studied how the attainable antitube diameters vary with w 
for the paramagnetic confinement liquids given in Table 1. There is good 
agreement between the experimental points for water antitubes and 
the predictions of equations (3) and (4), using measured susceptibil- 
ity x and interface energy o (Extended Data Table 1). The addition ofa 
surfactant to the water phase, to the magnetic fluid phase or to both”° 
leads to smaller antitubes since ois lowered; 1 vol% or 23 mM Span-80, in 
the ferrofluid, and1vol% or 0.58 mM Tween-20 in the aqueous antitube. 
Although the critical micelle concentrations of Span-80 and Tween-20 
in water-hydrocarbon interfaces are 1.2 mM and 0.58 mM, respec- 
tively, we did not observe surfactant-induced structure formation. 
The close agreement is shown in the plot of dimensionless diameters 
d* in Fig. 2e where plots of the experimental values versus predicted 
diameters collapse onto one curve for all ferrofluids, surface tensions 
and magnet gaps used. The smallest antitube that could be detected so 
far has d=14.+2 um, obtained when using astrong ferrofluid, EMG900 
(Extended Data Fig. 3g). Our model predicts that antitubes below1 um 
canbe stabilized, with a100-pm magnet spacing anda strong ferrofluid 
(MKC), but they are currently below the detection limit of our imaging 
methods. 

Magnetic confinement can be used to implement basic microflu- 
idic operations. To make branched antitube devices, we resort to 
out-of-plane quadrupolar fields, made by waterjet-cutting two stacked 
Nd,Fe,,B plates (Fig. 3a). The null-field line follows the channel centre 
independently of the channel angle with respect to the magnets, which 
is not the case for in-plane quadrupoles (not shown). Symmetric split- 
ting of the flow was demonstrated in a ferrofluid antitube Y-junction 
(Supplementary Video 7). Merging of the flow at a Y-junction was visual- 
ized for antitubes stabilized by Magoil. Remarkably, merging and rapid 
mixing occurs immediately after a Y-junction (Fig. 3b), similar to mixing 
in magnetically stabilized aqueous paramagnetic tubes surrounded by 
water’®. This behaviour is in contrast to the laminar flow observedina 
3D-printed microfluidic chip with the same channel size and geometry 
as the antitube (Fig. 3c). 

As our microfluidic circuits are magnetically defined, we can exploit 
the fact that structured magnetic fields can be modulated by mechani- 
cal or electrical means to impose unique and versatile control of fluidic 
devices. Valves can be constructed by moving one or two longitudinally 
magnetized bars towards the quadrupole axis. The valving magnets 
pinch off the antitube by removing the null field at the centre (Fig. 3d, e), 
thereby imposing alocal magnetic pressure, P=1.M-H, and interrupt- 
ing the liquid flow (see Supplementary Videos 8-10, which plot the 
calculated 150-mT isovolume surfaces). The burst pressure of these 
valves was tested with an Elvesys OB1 pressure-driven pump. A sin- 
gle transverse valving magnet could sustain a pressure of 125 mbar, 
whereas a dual valve (Fig. 3f) withstood 300 mbar. Pumping is an 
extension of the valving principle; pinch points of magnetic pressure 
seal off pockets of pumped liquid (for example, water) which are then 
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Fig. 2| Properties and scaling of water antitubes. a, Top-view X-ray images 
illustrating the mechanical rupture of a static tube by a spatula, followed by 
self-healing. The tube returns to equilibrium, without being under flow, 

within minutes. b, Glass beads (0.6 mm) inserted into a1.5-mm antitube 

can be expelled witha slight increase in applied pressure of 20 mbar.c, A 
2-mm-diameter bead, larger than the antitube diameter (d= 0.5mm), does not 
cause clogging. There is only asmall decrease in flow rate (from 30 pl min‘ to 
23 pI min“) when the beadis present. Ina-c, the scale bars are5mm. 

d, Comparative flow of honey under gravity (see Supplementary Video 4) 
through an antitube (left), anormal tube of the same diameter (right) and in 
free fall (centre; tube open across full diameter). Scale bar, 4cm.e, Plot of 
experimental dimensionless antitube diameters d* = d/w (where wis the 
spacing between the magnets) versus theoretical values calculated from 
equation (4), for aseries of magnetic fluids. Numbers indicate the smallest tube 
diameters in micrometres. *Water containing 1 vol% Tween-20. 'l vol% Span-80 
inthe ferrofluid in addition to1 vol% Tween-20 in the water. For ferrofluid data, 
each data point represents the average of at least two experiments, for whichat 
least five images were recorded each. For Magoil*, at least 10 snapshots were 
taken from an experimental video. Error bars are standard deviations. 


displaced by the mechanical actuation of the valving magnets. This 
can be done by sequential excitation of electro- or electro-permanent 
magnets, or a rotating array of magnetic spokes (see Extended Data 
Fig. 6d-f). A more sophisticated pump based on radially magnetized 
arc-segments fixed onto a rotor and stator, and a 3D-printed fluidic 
support, is illustrated in Fig. 3g, with magnetic field contours shown 
in Extended Data Fig. 6j. This ‘Qpump’, using MKC ferrofluid, produces 
pressures of up to 900 mbar and flow rates of 32.7 + 0.3 ml min” (Sup- 
plementary Video 12, Fig. 3h). 

Because the antitube inside the Qpump has no solid walls, we 
expected that magnetostaltic pumping would be gentler than peri- 
staltic pumping, in which a plastic tube is mechanically squeezed by 
aroller. It is known that blood pumping results in haemolysis: that 
is, shear-induced rupture of red blood cells that releases haemoglo- 
bin??°, High concentrations of free haemoglobin are cytotoxic and 
have been associated with clinical complications including an increased 
incidence of thrombosis, morbidity and mortality” *°. We therefore 
compared the effects of pumping 6-ml samples of human donor blood 
(collected on hirudin anticoagulant) at 1.5 ml min™ for 1h ina closed 
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Fig. 3 | Magnetically implemented mixing, valving and pumping. 
a, Out-of-plane magnetization configuration for a Y-junction cut ina double 
magnet sheet. b, Optical image of an aqueous antitube stabilized by Tb**- 
Magoil. Blue and pink dyes are introduced at the inlets (300 pl min”) and mix 
immediately upon contact before flowing to the outlet. c, Comparison with 
anormal microfluidic channel, in which no mixing is observed. d, e, Valving 
with one magnet (N,=1, Supplementary Video 9; d) or two magnets (NM, =2, 
Supplementary Video 10; e). f, Measured flow rate Qunder an applied pressure 
of 100 mbar at the exit port controlled by one or two valving magnets. g, Top 
view and isometric view of a magnetostaltic Qpump with rotating magnetic 
segments. The orientation of magnetization for the arc segments is radially 
outward (red) or inward (blue). h, Flow rate of water versus rotation rate, w, of 
theinner rotor. Error bars represent the standard deviations over three 
samples. 


loop, using either the Qpump or a peristaltic pump (Fig. 4a). Whereas 
for water sub-ppm transfer of magnetite nanoparticles was observed, 
for blood 285 +161 ppm (pg Fe,O, per g blood) was detected (see Meth- 
ods sections ‘Quantitative analysis of water contamination’ and ‘Blood 
pumping and analysis’). However, this did not affect haematological 
parameters suchas haematocrit and cell counts of the pumped blood, 
which were unchanged compared with the peristaltic pump (Fig. 4b and 
Extended Data Table 2). Moreover, functions of platelets that circulated 
through the Qpump were unchanged, as they responded normally to 
agonists such as thrombin-receptor-activating peptides (TRAP) in 
the widely used light transmission aggregation test (Fig. 4b). Further 
inspection of platelet morphology by scanning electron microscopy 
(SEM) confirmed that platelets were intact and normal after pumping 
(Extended Data Fig. 7a—c). In short, these experiments revealed no 
adverse effects on whole blood pumped using the Qpump, as compared 
with a peristaltic pump. 

Incontrast, a large difference was found in the degree of haemolysis 
(Fig. 4d, e, Extended Data Fig. 7d). Platelet-rich plasma (PRP) obtained 
by centrifugation of whole blood is transparent after pumping using 
the Qpump, but it is bright red after peristaltic pumping (insets, 
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Fig. 4 | Blood pumping by a magnetostaltic pump. Blood was circulatedina 
closed-loop circuit either by the Qpump (filled with 7x diluted MKC ferrofluid) 
or anormal peristaltic pump. a, Closed-cycle pumping scheme used. 

b, Blood-cell counts (RBC, red blood cells; PLT, platelets; WBC, white blood 
cells) after pumping normalized versus control blood. c, The degree of platelet 
aggregation activated by TRAP tested for PRP from blood after peristaltic 
pumping and Qpumping, compared with control blood. Light transmission is 
minimal before aggregation as platelets scatter light, but upon addition of 
TRAP the platelets aggregate and transmission increases. d, Absorption 
spectrum of PRP from blood after peristaltic pumping and Qpumping, 
compared with control blood, 25 times diluted with phosphate buffer. 

e, Plasma-free haemoglobin (PFH) concentration estimated from the 
absorbance of PRP for Qpump and peristaltic pump. Blood was used from three 
different anonymous donors (see Methods section ‘Blood pumping and 
analysis’), and bars show data from at least six measurements. Error bars show 
standard deviations. 


Fig. 4d). From the ultraviolet—visible (UV-vis) absorbance of haemo- 
globin” (Fig. 4d), the concentration of plasma-free haemoglobin (PFH) 
was determined to be 130 + 40 mg dl for the peristaltic pump and 
12+5 mg dl“ for the Qpump (Fig. 4e). The degree of haemolysis is 11+6 
times lower for blood pumped by the Qpump than for the traditional 
peristaltic pump. PFH values greater than 20 mg dI ‘are indicative 
of high haemolysis”, and previous reports on transfusion of packed 
red blood cells by means of infusion pumps showed PFH values 2-3 
orders of magnitude higher”. This indicates that magnetic pumping is 
a promising way to achieve gentle transportation of blood, or possibly 
other fragile entities such as antibodies or stem cells. Further studies 
are planned with a mouse model of extracorporeal membrane oxy- 
genation, using a Qpump instead of a peristaltic pump, to investigate 
survival rates and possible systemic responses upon ferrofluid contact. 

Magnetic control of liquid-in-liquid flow opens new possibilities for 
microfluidics, allowing new channel shapes and low-pressure cargo 
transport surpassing the current capabilities of standard methods. 
We have identified the basic physical quantities controlling the size 
of confined diamagnetic fluid circuitry and have given examples of 
key microfluidic device elements. The method promises low-shear 
flow and pumping, which is of growing importance in biotechnology 
where delicate cells, proteins and antibodies are commonly damaged 
by traditional pumps” ™. If scaled up to 5-7 | min and combined with 
a gradient-field separator™ to remove traces of transferred magnet- 
ite nanoparticles, magnetic blood pumping might be implemented 
in heart-lung machines during cardiopulmonary bypass surgery, or 
in devices for extracorporeal membrane oxygenation”. Ata more 
fundamental level, we envisage miniaturized fluidic circuits without 
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solid walls that will be scalable down to the submicrometre level. We 
can then take advantage of the versatility of magnetic control at the 
nanoscale to open the door to practical low-pressure nanofluidics*> *”. 
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Methods 


Synthesis of rare-earth oil 

Inspired by the well-known gadolinium complexes of DTPA, used as par- 
amagnetic contrast agents in magnetic resonance imaging, we prepared 
an amphiphilic complex of DTPA with the paramagnetic rare-earthion 
holmium(I11), bearing two hydrophobic side chains connected to the 
DTPA moiety through ester linkage. In particular, by reaction of DTPA 
dianhydride with the branched alcohol 2-butyl-1-octanol we obtained a 
tricarboxylic chelating ligand, which on complexation with Ho* under 
alkaline conditions afforded a stable neutral HoO-DTPA complex. The 
latter, insoluble in water, was mixed with 2-butyl-1-octanol (30 wt% of 
the latter) affording a water-immiscible homogeneous fluid (Magoil) 
with positive magnetic susceptibility. The magnetic properties of the 
neutral HoO-DTPA complex (that is, before mixing with the alcohol) were 
evaluated by NMR measurement, using the Evans method’, giving a 
value 4.40 x 107 m? mol for the molar susceptibility, which is compa- 
rable with that measured for the inorganic salt Ho(AcO),, as well as of 
the order of magnitude expected for Ho™ (ref. *’). Our paramagnetic 
oil is a transparent non-ionic liquid, slightly pink-yellowish because 
of the fluorescence of Ho(II1), and it is physico-chemically stable in 
contact with aqueous solutions for several hours. Over longer periods 
of contact, hydrolysis occurs. Overall, these characteristics make it 
different from other magnetic fluids such as ferrofluids (black colloi- 
dal suspensions of nanometre-sized magnetic particles) or magnetic 
room-temperature ionic liquids. The preparation of the oil was easily 
scaled upto 25 g. We also used different trivalent rare-earth ions, such 
as erbium and terbium (Fig. 3b), obtaining analogous results. 


Preparation of the Ho-DTPA-(2-buthyl-1-octanol),-based 
paramagnetic oil 
DTPA dianhydride (5 g, 14 mmol) was dispersed in dry dimethylforma- 
mide (100 ml) under argon atmosphere, and the suspension obtained 
was heated to 70-75 °C under stirring, until complete solubilization was 
observed. After that, 2-butyl-1-octanol (5.5 g,28 mmol, 2 eq.), dissolved 
in10 ml DMF, was added dropwise to the above solution and the result- 
ing homogeneous mixture was left under stirring at SO °C for 4h. The 
completeness of the reaction was monitored by liquid chromatogra- 
phy/mass spectrometry. Afterwards, the solvent was removed (rotary 
evaporation plus high-vacuum pump) to give 2 as a sticky yellowish 
solid that was used without further purification (Extended Data Fig. 2a). 
To a stirred 0.2 M solution of the ligand 2 in ethanol (EtOH), an 
aqueous 1M solution of NaOH (3 eq.) was added dropwise at room 
temperature, followed by the addition of HoCl,:(6H,O) (1.5 eq. with 
respect to 2), dissolved in water to produce a final 0.1M solution of 
the complex of 3 in ethanol:water 1:1. The resultant homogeneous 
mixture was left stirring at room temperature until a viscous pink oil, 
whichresponds to magnetic fields, separated out from the water phase 
(typically 0.5-1h). After that, the solution was concentrated by rotary 
evaporation to remove EtOH, and the water phase was extracted with 
dichloromethane. The organic phase was separated, dried over MgSO, 
and rotary-evaporated to give the complex 3as a glassy solid. Finally, the 
latter was dissolved in the minimum amount of dichloromethane, and 
the fatty alcohol 2-butyl-1-octanol (30 wt%) was added to the resultant 
solution. The mixture was then rotary-evaporated under high vacuum 
to eliminate dichloromethane, resulting in the target homogeneous 
transparent paramagnetic oil. The absorbance of the paramagnetic oil 
(Extended Data Fig. 2b) matches the absorption peaks of Ho™ (ref. *°). 


Measurement of the magnetic susceptibility of the 
paramagnetic oil 

The magnetic susceptibility of the neutral complex 3 (Extended Data 
Fig. 2a) was measured by the Evans method, using NMR spectroscopy*’. 
This method relies on the fact that the chemical shift of the ‘'H NMR 
signals of amolecule depend on the bulk susceptibility of the medium. 


We used the resonance line of the residual proton of CDCI, (that is, 
the NMR solvent) as a reference, by comparing its chemical shift in 
pure CDCI, and in a CDCI, solution containing the complex 3. From 
the difference inthe chemical shifts we calculated the mass and molar 
susceptibilities Of 3 (Xmass= 4-88 * 107” m? kg} Vinci = 4.40 10” m3 mol’). 


Magnetic properties of the ferrofluids 

Magnetization measurements were performed in a vibrating sample 
magnetometer with a compact variable permanent magnet source”. 
Samples were mounted in 3D-printed sample holders with cylindrical 
chambers of diameter 3 mm and length3.88 mm or 5.88 mm, ensuring 
awell-defined demagnetization factor and maximal use of the uniform 
region of the pick-up coils. 

The magnetic field dependence of the magnetization was 
assumed to be due to dispersed non-interacting spherical particles 
in anon-magnetic medium, and the data were fitted with either a 
single-component or two-component Langevin function. After cor- 
recting for non-zero magnetic field offsets, the applied field H was 
corrected for the appropriate demagnetization factor, N (ref. *”), to 
get the internal field H’. Additional magnetization measurements were 
performed by measuring the inductance of solenoids immersed inthe 
ferrofluids. The solenoid inductance was measured using an LCR bridge 
meter (Hameg 8118) at 1 kHz. See Supplementary Information section 
for Langevin and demagnetization factor equations. 


Remanent magnetization of permanent magnets 

The remanent magnetization M, of the permanent magnets used was 
measured by profiling the external field with a Gaussmeter in the zdirec- 
tionalong the central axis and fitting the resultant profile to analytical 
expressions’ (see Supplementary Information section II for equations). 


Magnetic field simulations 

Assuming magnetically transparent, uniformly magnetized perma- 
nent magnets, 3D magnetic fields can be calculated with analytical 
expressions based on the surface charge model**“’. The magnetic field 
at each point is summed over the contributions of each magnet in an 
assembly. Full expressions are presented in Supplementary Informa- 
tion section III. 


Surface tension of magnetic liquids 

Asimple home-built pendent drop set-up was used to measure the 
surface tension of the magnetic liquids. The solutions were gradually 
dispensed in a stepwise fashion through a polished 74-um-diameter 
glass capillary using a syringe pump into a glass cuvette in a3D-printed 
support. The pendent drops were imaged using a Canon EOS 50D and 
Tamron superzoom 18-270-mm lens in RAW mode. RAW images were 
converted to monochrome TIFF files, with image processing being 
carried out in Image}**, and the surface tension calculated using ‘Pen- 
dent_Drop’, an Image] plug-in”. 

The drops were gradually enlarged, with multiple images taken at 
each size until the drops broke off, and this was repeated at least six 
times. The values reported in Extended Data Table 1 are pooled mean 
and variance, calculated from at least six different droplets, for at least 
15 droplet sizes, and three images per size. Overall, between 30 and 100 
images of droplets of various sizes were used per surface tension value. 
Surface tension measurements were cross-checked witha Kruss Drop 
Shape Analyzer DSA100 and found to be in agreement. 


Method for self-consistent diameter calculation with the full 
model 

For a given set of conditions (that is, permanent magnet remanent 
magnetization, quadrupole spacing, ferrofluid nanoparticle diameter 
and volume fraction, and surface tension), dcan be calculated ina 
self-consistent manner by calculating the magnetic field and magneti- 
zation at a position d/2 with the scheme in ‘Remanent magnetization 
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of permanent magnets’, next generating a new d using equation (3), 
using this as anew input d, and looping repeatedly until the difference 
between input and output is negligible. 


Side-view optical imaging and pressure/flow measurements 
Images were acquired with either a Leica MZ16 stereo microscope 
with aLeicaIC80HD digital camera, or aNikon SMZ745T stereo micro- 
scope and a SONY colour CCD camera (1/1.8”, 20 images per second, 
1,600 x 1,200 pixels). An Elveflow OB1 pressure-driven pump and 
Elveflow FS4 flow sensors were used to set the applied pressure and 
measure the resulting flow rates in and out of the antitubes. All image 
processing and analysis was carried out in Image]**. Image threshold- 
ing was performed using Otsu’s clustering method, with the resulting 
image converted toa binary mask. Particle analysis and circle fitting was 
performed on each image, resulting in cross-sectional area, and round- 
ness/circularity. Examples of cross-sections are shown in Extended 
Data Fig. 4d. 


Optical transmission imaging 

A Zeiss Axiovert 200M inverted microscope witha 5x NA =0.15 objec- 
tive and Andor Zyla 4.2 sCMOS detector were used to image through 
3D-printed microfluidic devices. These devices consist of a main chan- 
nel 200 um high and 1mm wide, aright-angled Y-junction for two water 
inlets, a ferrofluid inlet which joins the channel at its centre, a single 
outlet, and space to place four magnets to generate a quadrupolar 
field centred inthe main channel. The device is sealed with a thin glass 
coverslip. 

To stabilize antitubes, two methods were used. (i) The channel was 
prefilled with water, and the ferrofluid was injected, gradually displac- 
ing the water inside the channel; or (ii) the channel was prefilled with 
ferrofluid, with water injected to displace ferrofluid. 

Tube diameter was estimated with Image]. Plot profiles were gener- 
ated by taking column averages of images (Extended Data Fig. 3f, g), 
followed by subtraction of the background, leading to a sharp peak 
due to the lower absorption of water. This was fitted with a Gaussian 
peak function, whose full-width half-maximum (FWHM) corresponded 
to tube diameters measured from the optical side-view technique. 


Top-view optical imaging through Magoil 

ANikon SMZ745T stereo microscope and a SONY colour CCD camera 
(1/1.8”, 20 images per second, 1,600 x 1,200 pixels) were used to image 
water tubes inside Magoil. Similar to the X-ray and optical transmission 
images, the optical images were rotated and inverted (Extended Data 
Fig. 8), and profiles of column averages were extracted for data fitting. 
The background was subtracted, and the resulting peaks fitted witha 
Gaussian function. 


X-ray transmission imaging 

A MyRay dental X-ray imaging system was adapted for imaging through 
ferrofluids. It consisted of an X-ray source (MyRay RX-DC eXtend) anda 
detector (MyRay Zen X Tl) purchased from Castebel (Waver, Belgium), 
and confined inside a box (constructed in-house) covered by lead plates. 
In general, X-ray emission at 65 kV and 6 mA with exposure time of 0.1s 
was used to obtain a good-contrast image of water antitubes confined 
in ferrofluids. 

To measure the tube diameter, first abackground measurement was 
taken with a microfluidic device (Extended Data Fig. 3a, b) fully filled 
with ferrofluid. The image processing loop (also in ImageJ) consisted 
of taking the average of 10 images, rotated to have the tube fully verti- 
cal, with the intensity values inverted (Extended Data Fig. 3c). Next a 
plot profile was generated by taking column averages (Extended Data 
Fig. 3d, red line). Then a freshly cleaned device was filled with water, 
and ferrofluid was added progressively in 50 pl steps at a flow rate of 
100 pl min”, with 10 images taken at each step. The same imaging pro- 
cess was carried out as with the background, followed by subtraction 


ofthe background plot profile, leading to a sharp peak due to the lower 
absorption of water (Extended Data Fig. 3e). This was fitted with a 
Gaussian peak function, whose FWHM corresponded to tube diam- 
eters measured from the optical side-view technique. The procedure of 
adding ferrofluid was continued until a continuous tube was no longer 
observed. For improved contrast sensitivity, additional transmission 
images were taken with a Hamamatsu C7942CK-12 flat panel sensor in 
place of the MyRay Zen X T1 (Extended Data Fig. 1a, b). 


Synchrotron X-ray tomography 

Phase-contrast synchrotron X-ray tomography was carried out at the 
X02DA TOMCAT beamline of the Swiss Light Source (SLS) at the Paul 
Scherrer Institute. The X-ray beam, produced by a2.9-T bending magnet 
ona 2.4-GeV storage ring (with ring current /=400 mA), was mono- 
chromated to 30 keV. We used a high-speed CMOS (complementary 
metal-oxide-semiconductor) detector (PCO.Edge 5.5) coupled to 
a KinoOptik 1x visible-light objective with a 300-ym-thick LuAG:Ce 
scintillator, for a field of view 16.6 mm x 14.0 mm and effective pixel 
size 6.5 um. The sample-to-detector distance was 327 mm. Rawimages 
were acquired with 50-ms exposure times across 1,500 tomographic 
projections. Twenty dark and 100 flat-field images were collected for 
each scan. 

The samples consisted of 3D-printed supports with circular 
cross-section and a1.5-mm-diameter channel. There is agroove onthe 
top and bottom of the holder to accommodate two N42 grade Nd,Fe,,B 
magnets (10 x 10 x 10 mm) on each side at a distance of 3 mm. The 3D 
print is designed so that the centre of the null field is close to the centre 
of the fluidic channel. The channels were prefilled with water contain- 
ing 1 vol% Tween-20 surfactant and imaged after sequential injection 
of known volumes of a ferrofluid with a syringe pump. 

Before further analysis, each projection was corrected with the 
respective dark and flat-field image. Computed tomography recon- 
struction was performed with the ‘gridrec’ algorithm*’, enabling fast 
reconstructions of large datasets. Tomographic slices were rendered 
using Image] 1.52i, and the volumetric cross-sections were rendered 
using Paraview 5.6.0. 


Antitube durability 

The durability of the antitubes was tested by varying the flow rate of 
water with a syringe pump to determine the threshold value at which 
ferrofluid is sheared out. For this test, we used a fluidic chip that con- 
sists of four magnets (each 9 x 9 x SO mm) with gaps of 9 mm between 
magnets. No major leakage of ferrofluid was observed throughout 
the measurement, and the threshold value was reported for the flow 
rate when iridescence due to athin film of oil on the water surface was 
observed. The observed data (Extended Data Fig. 4a, b) are valid only 
for this specific magnet arrangement and fluidic circuit, as the magnetic 
field distribution varies when the size and arrangement of magnets are 
changed, and outlets perpendicular to the main channel inhibit further 
leakage. In general, the threshold value increases when more viscous 
or magnetically stronger ferrofluids are used for antitube formation 
and decreases as the antitube diameter decreases. 


Quantitative analysis of water contamination by magnetic 
media 

The amount of magnetic media leaked into water was quantified by 
atomic absorption spectroscopy (AAS; Agilent Technologies 200 Series 
AA). The emission wavelengths of iron (372 nm) and holmium (559 nm) 
were used for ferrofluids and Magoil, respectively. The standard samples 
for iron were prepared by dissolving iron metal in hydrochloric acid and 
diluted by distilled water to obtain a series of concentrations to construct 
acalibration curve. Each point is from five consecutive measurements. 
We mixed 1 ml of ferrofluid (MKC and MD4) and 25 ml of distilled water in 
aseparatory funnel and left them overnight. We then transferred 20 ml of 
the distilled water from the funnel to a vial and removed water by using 


a rotary evaporator. Hydrochloric acid (1 ml) was added to the vial to 
dissolve any remaining iron. The iron solution was diluted to make 50 ml 
intotal and used for AAS. Similarly, we mixed 1 ml of Magoil with 25 ml of 
distilled water. Five consecutive measurements were performed for each 
sampleto build the statistics. Emission intensities of 0.049 + 0.002 and 
0.050 + 0.004 were observed from the water samples in contact with MKC 
and MD4, respectively. Using the calibration curve and error propaga- 
tion analysis, the concentration of the iron from water in contact with 
MKC and MD4 was determined as 0.78 + 0.05 ppm and 0.79 + 0.08 ppm. 

Additional checks were made by flowing water through antitubes 
of three different diameters, 3.5 mm, 1.5 mm and 0.8 mm. The cav- 
ity containing both ferrofluid and water was 6 mm in diameter. We 
used DI water as transported fluid. The surrounding ferrofluid was 
APG1141 with viscosity 5 Pas. Three flow rates were used, correspond- 
ing to 10%, 50% and 75-80% of the threshold flow rate for each diam- 
eter (Qihres= 80 ml min“, 60 ml min‘ and 30 ml min“, respectively; see 
Extended Data Fig. 4a, b). The set-up consists of a pressure-driven pump 
(Elvesys OB1) connected to a water reservoir which is then connected 
tothe antitube housed in a 3D-printed support. At the outlet, the flowis 
split into two streams. One passes directly to a tube, and the other flows 
adjacent toa magnet (6 x 6 x 50 mm) before entering a second tube. This 
additional magnet was placed with its long axis parallel to the channel 
flow and is there to capture any ferrofluid that may inadvertently shear 
out under flow. The water was flowed continuously for 1.5 h, and 45-ml 
samples were collected in each Falcon tube at the beginning, middle 
and end of the experiment. All solutions were evaporated, followed by 
the addition of 1 ml concentrated HCI, which was allowed to react for 
1hand diluted with 45 ml of ultrapure H,O. The entire procedure was 
repeated to produce a total of six samples per experimental condition, 
with the data reported in Extended Data Fig. 4f being the average values, 
and error bars being the standard deviations. Here we found that all Fe 
concentrations detected were below 0.3 ppm, with most of them within 
one standard deviation for a series of blank measurements. 

Magoil hydrolyses in a few hours, and therefore 20 ml of water was 
transferred to a vial after one hour and diluted to make a total 50 ml of 
the sample for AAS. Holmium(r11) chloride was dissolved in distilled 
water and diluted to obtaina calibration curve. The emission intensity of 
0.058 + 0.001 was observed from the water sample that was in contact with 
Magoil, which corresponds to 2.6 + 0.1ppm based on the calibration curve. 


Construction of Qpump 

The Qpump housing was 3D-printed using F170 (Stratasys) with ABS 
(acrylonitrile butadiene styrene)-M30. Nd.Fe,,B arc magnets were 
purchased from NEOTEXX (TR-036.5-33-20-NN and -SN, TR-028.5-25- 
20-NN and -SN). A high-power gear motor (RobotShop, Devantech 
24V) was used to rotate the inner rotor of the Qpump. The fluidic chips 
were 3D-printed using a Form 2 print (Formlabs) with a clear resin (FLG- 
PCLO2). The chips were then connected to Tygon tubing (Saint-Gobain, 
ND 100-65, 1/16” inner diameter (ID) x 1/8” outer diameter (OD)). 


Blood pumping and analysis 

Human blood (50 ml) froma unique donor at!’Etablissement francais du 
sang (EFS) Strasbourg was used for each set of experiments. The blood 
was collected on anticoagulant with hirudin (100 units mI’). Six 6-ml 
tubes were prepared for each experiment: one as a negative control (not 
used in a pump), three for use with a Qpump, and two for a peristaltic 
pump. To build statistics, blood from three different donors was used 
to construct the data reported here. The rotation rate of motors was 
adjusted to obtain the flow rate of each pump at -1.5 ml min”. Tygon 
tube (Saint-Gobain, Tygon 3350, 1/16” ID x 3/16” OD) with stoppers was 
used with the peristaltic pump (Fischer Scientific, CTP300). 

After an hour of blood pumping in a closed circuit (Fig. 4a), the 
whole blood from each tube was analysed with a haematology analyser 
(Sysmex Europe, XN-1000). Then the whole blood was centrifuged 
(SORVALL RC3BP, 10 min at 250 g, room temperature) to obtain PRP 


for UV-vis spectroscopy (Agilent, Cary 8454) and light transmission 
aggregometry (APACT 4004). For UV-vis spectroscopy, PRP was 25 
times diluted with phosphate buffer. Light transmission was recorded 
during 5 min while platelets aggregated after the addition of 30 pl of 
agonist (TRAP 10 uM) into 270 pl of PRP. 

To perform electron microscopy imaging of platelets, samples were 
prepared as follows: first, 100 pl of PRP was added to 400 pl glutar- 
aldehyde 2.5% (Euromedex, 16210) in cacodylate (CACO) buffer ina 
15-ml tube and incubated overnight at 4 °C. Then the solution was 
centrifuged for 8 min at 2,.000g to obtain pellet platelets, which were 
then resuspended with 2 ml CACO buffer for washing. This solution 
was centrifuged again for 8 min at 2,000g, and the collected platelets 
were resuspended with 250 ul of CACO buffer. Finally, the solution 
was diluted 10 times, and 100 ul of diluted solution was deposited on 
a poly-L-lysine (Sigma Aldrich, P8920) coverslip (Euromedex, 72226- 
01) to sediment the platelets at room temperature. After 30 min, the 
excess solution was removed, and the substrate was washed twice with 
CACO buffer for 5 min. Dehydration of the substrate was done with etha- 
nol (LPCR (Les Produits Chimiques de la Robertsau) 20821296): 70% 
(3x5 min), 85% (5 min), 95% (5 min) and 100% (2 x 30 min). The substrate 
was then desiccated with hexamethyldisilazane (LPCR, 112186.0100): 
1/4 HMDS + 3/4 ethanol 100% (5 min), 1/2 HMDS + 1/2 ethanol 100% 
(5 min), 3/4 HMDS + 1/4 ethanol 100% (5 min) and pure HMDS (2 x 5 min). 
The coverslip was fixed onan aluminium support mount (Euromedex, 
75230) with conductive carbon cement (Euromedex, 12664). The sam- 
ple was metallized (coating by ~10-nm layer of platinum and palladium) 
by aCressington 208 HR sputter coater and observed with Phenom Pro 
Desktop SEM for high-quality imaging. 

Ferrofluid contamination in the pumped blood was determined using 
the protocol in ‘Quantitative analysis of water contamination by magnetic 
media’, along witha complementary method of measuring the magnetic 
moment of the whole blood samples ina SQUID magnetometer (Quan- 
tum Design MPMS3 SQUID-VSM). First, the ferrofluid MKC was dried 
by rotary evaporation to measure the magnetic response without the 
hydrocarbon carrier fluid. Similar to the Langevin function fitting (see 
‘Magnetic properties of the ferrofluids’), the concentration of superpara- 
magnetic magnetite nanoparticles in the powder was found to be 76%. 

Next, eight whole-blood samples, four pumped through two differ- 
ent Qpumps, two pumped through a peristaltic pump, and two con- 
trols (not pumped at all), were lyophilized before mounting in gelatine 
capsules. The control and peristaltic pumped samples showed only a 
diamagnetic susceptibility at room temperature, as expected for oxy- 
haemoglobin, with an average susceptibility of -2.3+0.3Am’kg!T! 
(after correction for sample holder contributions). In these blood sam- 
ples, deoxyhaemoglobin is visible only at low temperatures, where 
a Brillouin function fit reveals an average of four unpaired spins, as 
expected. In contrast, the four magnetically pumped samples returned 
an average superparamagnetic moment of 26.2 +14.8 x10? Am’kg™, 
which is the equivalent of 285 + 161 ppm of Fe,O, per gram of blood. 


Derivation of equations (1) and (2) 

We use the steady-state Stokes equations in radial coordinate for 
ferrofluid as well as honey. The fluid landscape is divided into three 
regions, namely the honey flow (region I), ferrofluid with shear flow 
(region II) and reverse flow of ferrofluid (region III) (see Extended Data 
Fig. 5e). The solid walls are given a zero-velocity boundary condition, 
and the interface between the liquids has continuous velocity and shear 
stress. As there is no net flow of ferrofluid, the flow rate of ferrofluid 
in region III is equal and opposite to that in region II. This can be seen 
in the normalized velocity profile in Extended Data Fig. 4e which was 
also confirmed by numerically solving the Navier-Stokes equationin 
Extended Data Fig. 5a—d. Finally, solving the Stokes equations with the 
respective boundary conditions, the effective slip lengthis defined in 
terms of the flow rate (for a fixed pressure gradient) inthe system. The 
full derivation is presented in Supplementary Information section III. 
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Derivation of equilibrium diameter equation (3) 

The approach starts with the magnetically augmented Bernoulli equa- 
tion, choosing appropriate boundary conditions at two points (see 
Extended Data Fig. 5f), one at the centre of the antitube in the null field 
region, P,, and the other at the boundary between the inner solution and 
the magnetic liquid, P,, and solving the Bernoulli equation to arrive at an 
expression due to the balance of energies at the interface. A derivation 
is provided in Supplementary Information section IV. 


Derivation of linear and saturation models for equilibrium tube 
diameter 

Equation (3) is simplified to equation (4) by linearizing the magnetic field 
dependence inside the quadrupole, assuming alinear magnetic suscep- 
tibility for the ferrofluid, and inserting both results into equation (3). 
A full derivation is described in Supplementary Information section V. 


Computational fluid dynamics 

Finite element methods using ANSYS 18 were used to simulate fluid dynam- 
ics ofa honey antitube inside EMG900 ferrofluid. A full 3D simulation was 
performed, with the following boundary conditions applied: at the inlet 
we define the flow rate (Q=175 pl min”), and the outlet is set to atmos- 
pheric pressure. The interface between liquidsis defined as two walls with 
constant magnitude of shear stress (1). We vary the shear stress until the 
velocity at the interface becomes continuous. Once the shear stress and 
velocity become continuous, the solution is accepted. Here T= 0.45 Pa 
for the semi-infinite case and t= 0.47 Pa for the case in which we include 
the effect of curved inlet and outlet. The reverse flow as predicted by the 
analytical modelis clearly seen inthe simulations. The antitube diameter 
is 1.2mmand the cavity size is 4.4 mm. The fluids used are honey with vis- 
cosity 10 Pasand EMG900 (ferrofluid) with viscosity 0.06 Pas. To simplify 
the simulations, we use free slip conditions at the curved inlet and outlet 
region. This is because the change in the velocity in the curved inlet and 
outlet region is dominated solely by the change in diameter. 

When comparing an infinite channel (Extended Data Fig. 5a, b) toa 
finite channel (Extended Data Fig. 5c, d), we observe the same coun- 
terflow of ferrofluid. For the finite channel, the velocity away from 
either the inlet or outlet has a flat, plug-like profile (Extended Data 
Fig. 5d) identical to the infinite channel (Extended Data Fig. 5b), but 
at the outlet there is a large drop in the flow velocity as the volumetric 
flow rate is fixed and the channel widens. Furthermore, most of the 
pressure drop occurs at both the inlet and outlet where the channel 
diameter changes (Extended Data Fig. 5c). Additionally, we observe 
the same pressure drop inside the finite antitube away from the inlet 
and outlet, as found for the infinite antitube (Extended Data Fig. 5a). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Source data for Figs. 2,3 and 4 (and Extended Data figures containing 
data graphs) are provided with the paper. Any other data that support 


the findings of this study are available on the Zenodo data repository, 
https://doi.org/10.5281/zenodo.3603029 


Code availability 


The Python code for calculating magnetic fields is available on the 
Zenodo data repository, https://doi.org/10.5281/zenodo0.3603029 
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Extended Data Fig. 1| Adaptability of antitube topologies to arbitrary 
magnetic fields. a, b, X-ray transmission images of an antitube of water insidea 
ferrofluid for cuboid magnets (6 x 6 x 50 mm) with 6-mm gap (a); and matching 
arc magnets of height 20 mm, 3.5-mm gap, and inner and outer diameter pairs 
of:ID 25mm, OD 28.5 mm; and 1D 33 mm, OD36.5mm (b). 

c-f, Antitube cross-sections using non-quadrupolar fields: the magnetic field 


calculation (c) and experimental cross-section (d) for a four-magnet 
arrangement; and the magnetic field calculation (e) and experimental 
cross-section (f) for asix-magnet arrangement. Scale bar, 3mm. See 
Supplementary Videos 5, 6. g, Side view of a 1-m-long water antitube (d=2 mm), 
which allows light to pass throughout, showing the continuous water phase. 
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Paramagnetic Oil 


b, UV-vis absorption spectrum of Ho**-based paramagnetic Magoil. 
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Extended Data Fig. 2 | Magoil synthesis and optical absorption properties. a, Magoil synthesis reaction scheme. DMF, dimethylformamide; EtOH, ethanol. 
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Extended Data Fig. 3 | Process steps involved in X-ray and optical imaging of antitube fitted with a Gaussian peak function. f, Optical image of asub-100-m 
antitubes in ferrofluids. a, b, Typical quadrupole assemblies used for X-ray water antitube in an EMG900 ferrofluid (double surfactant). g, Intensity profile 
measurements.c, Inverted transmission X-ray image.d, Transmissionaveraged acrossthe microfluidic channel in the vicinity of the water antitube. The profile 
along the channel. e, Background-corrected transmission through the water is column-averaged along the length of microfluidic channel. 
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Extended Data Fig. 4 | Antitube stability and shear resistance under flow. 
a,b, The threshold flow rate for antitube diameters (a) and areas (b).c, Relative 
dilation of an antitube in APG311 ferrofluid with a quadrupolar gap, w, of 10 mm 
under flow and with the outlet closed. d, Side view through the antitubes under 
static conditions with no flow (outlet closed) for two magnet gaps at 
equilibrium (0 bar) and under 0.9-bar pressure. e, Normalized velocity profile 
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Flow Experiments 


inside honey using equations (1) and (2), corresponding to data in Fig. 2d. Inset 
shows velocity profile in both honey and ferrofluid. f, Concentration of Fe 
found in water after pumping through three different antitube diameters for 
three flow rates with and without an extra magnet on the outlet flow. Blank 
tests for pure water give the grey background threshold detection. Values are 
averages of six samples, error bars are standard deviations. 


Antitube 


Ferrofluid 


Ferrofluid Pressure (Pa) 


_— _—_ | 
-3.0 -2.55 -20 -15 -10 0 -2 2 6 10 14 18 


Velocity (mm s*) 


04 02 08 14 20 26 


Ferrofluid Pressure (Pa) Antitube Pressure (Pa) 


Extended Data Fig. 5| Flows in antitubes and surrounding ferrofluid using 
computational fluid dynamics. a—d, Contour plots from numerical 
simulations of a honey antitube in EMG900 ferrofluid under a flow rate of 

175 yl min” for two cases: first, the semi-infinite case with no inlet effects (a, b); 
second, the finite case including inlet effects with the ferrofluid contours 
matching those found by experiment (c, d) (compare with Extended Data 

Fig. 1a). The plots are (a, c) isometric pressure contours for at the outlets; note 
the different colourmaps for the pressure inside the ferrofluid versus inside the 
antitube; (b, d) velocity vector field at the outlets. e, f, Geometries used in the 
derivation of equations (1)-(4). e, Cross-section of liquid tube system 
considered in derivation of equilibrium diameter equation (3) (see Methods), 


55 75 95 


with four different flow regions consisting of: 1, honey; II, a parallel flow of 
ferrofluid; III, acounterflow of ferrofluid; and IV, a fictitious region to define 
the radial distance x at which the flow velocity becomes zero. Thus, the slip 
length fora flow of honey is b=x - R, where t;is the thickness of the ferrofluid, 
nis the thickness of region with shear flow and Ris the radius of the honey tube. 
f, 2D geometry of four bar magnets in a quadrupolar configuration considered 
in the derivation of linear and saturation models for the equilibrium tube 
diameter (see Methods and Supplementary Information). The hatched region 
denotes the ferrofluid, and the white region in the centre is the contained liquid 
tube. 
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Extended Data Fig. 6 | Demonstration of photo-polymerization and 
pumping functionality in antitubes.a, A scheme of the fluidic chip used for 
the extrusion of a photopolymer resin and its photopolymerization by 405-nm 
laser light during extrusion. b, c, Photos of polymerized tubes extruded in (b) 
an aqueous HoCl, solution, and (c) an aqueous MnCl, solution. The diameter 
of the tubes decreased as the magnet gap decreased, and those 
photopolymerized in Ho™ were smaller, as Ho™ has higher magnetic 
susceptibility than Mn”. d, Isometric-view of a six-spoke magnetostaltic pump 
(see Supplementary Video 11). e, Slow rotation (2 rpm) leads to pulsed flow, 
whereas fast rotation (14 rpm) produces a smoother flow. f, The average flow 
rate and standard deviation versus rotation rate w. g, The magnetic 
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configuration for an antitube (section nearer the viewer) anda pinching region 
(further section) using magnets (6 x 6 x 50 mm) witha gap of 6mm. 

h, Isosurface plot of the calculated magnetic field for g. The weak field region 
where water can reside is highlighted in white. A water antitube created by the 
quadrupolar arrangement (left half) is disrupted at the interface between two 
regions. The field created by the pinching geometry hasa field strength of 0.5T 
at the centre of magnets (represented by the dark blue colour). i, Top view of 
the Qpump based on this principle. The orientation of magnetization for the 
arc segments is radially outward (red) or inward (blue). j, x-y contour plot of the 
calculated magnetic field along the z-axis centre of the Qobump. 


Extended Data Fig. 7| Comparison of platelet and whole blood quality in change due tothe activation of platelets was observed for either type of pump 
peristaltic and magnetostaltic pumping. a—c, Representative SEM images of (peristaltic or Qpbump).d, A picture of the whole blood ina tube after peristaltic 
platelets: after peristaltic pumping (a); platelets from control blood (b); and pumping (left), control (middle) and Qpump pumping (right). 

platelets from blood after Qobump pumping (c). No major morphological 
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Extended Data Fig. 8 | Process steps for optical imaging of antitubesin 
Magoil. a, b, Optical micrographs of a water antitube in Magoil for agap width 
w=220 um (a) and w=307 pm (b).c,d, Greyscale, rotated and inverted images 
for bandarespectively. e, f, Column average profiles of gap for aandb, 
respectively. g, h, Gaussian function fits to the background subtracted 


x (um) 


profiles. Note: the 94-m antitube is thermodynamically stable, as the image 
was taken during the extrusion of the water antitube, whereas the 30-um tube 
is thermodynamically unstable. After injection, water was then extracted, 
resulting in a thinning of the tube, which at this diameter collapses into 
dropletsina matter of minutes. 


Extended Data Table 1| Physical properties of magnetic liquids used 


Sample p qd, d, >, , x On, Oweter Owween 

kg m*? nm nm % % mN mt mN mt mN m* 
APG 311 950+50 8.2+0.03 - 2.2+4.10° - 0.151 + 0.001 24.740.8 22.8408 5940.6 
EMG 900 1720490 11.340.1 540.1 9.2401 3.3401 1.67 + 0.04 22405 29.2+1.4 720.3 
Magoil 970 +50 = = - 4.7x104 + 0.5x10~4 - 5.8+0.4 


Measured physical properties of the two commercial ferrofluids (APG311 and EMG900; Ferrotec) and paramagnetic oil (Magoil): density p, nanoparticle diameters d, and d, (for bimodal size 


distributions), corresponding volume fractions ¢, and ¢, of magnetite in the suspension, initial susceptibility x, surface tension o with air, surface tension with water, and surface tension with 1% 
Tween-20 in water. 
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Extended Data Table 2 | Cell counts of whole blood 


RBC HGB HCT MCV MCH MCHC RDW-SD RDW-CV PLT&F PDW MPV P-LCR PCT IPF WBC 
unit 107 pL g/dL % fL Pg g/dL fL % 107 pl fl fL % % % 10°97 ph 
ctrl 4.07 12.3 35.5 87.2 30.2 34.6 42.8 13.2 203 11.6 10.5 28.2 0.21 33 3.58 
Qpump 1 411 123 35.7 869 29.9 34.5 42.1 13.2 169 108 102 268 017 3.3 3.44 
Qpump 2 3.99 12 34.7 87 30.1 34.6 41.4 13.1 159 114 105 269 017 3.3 3.46 
Peristaltic pump 1 3.56 10.7 304 85.4 30.1 35.2 40.6 13.1 192 119 107 29 022 42 3.84 
Peristaltic pump 2 4.06 12.3 34.7 85.5 


Comparing a control experiment, after pumping by Qpump, and after peristaltic pumping, measured by haematology analyser (Sysmex XN-1000). Each set of experiments was performed 
using blood from the same donor. RBC, red blood cells; HGB, haemoglobin; HCT, haematocrit (volume percentage of RBC); MCV, mean corpuscular volume (average RBC volume); MCH, mean 
corpuscular haemoglobin (average quantity of haemoglobin per RBC); MCHC, mean corpuscular haemoglobin concentration; RDW, RBC distribution width (-SD for standard deviation, -CV for 
coefficient of variation); PLT&F, platelet counts by fluorescence analysis; PDW, platelet distribution width; MPV, mean platelet volume; P-LCR, platelet large cell ratio; PCT, plateletcrit (volume 
percentage of platelets in the blood); IPF, immature platelet fraction; WBC, white blood cells. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
“— AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
“—! Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection For blood experiments (life sciences): SQUID magnetometer from Quantum Design MPMS3 (software: MPMS 3 MultiVu v 2.3.3.2); Light 
transmission aggregometry APACT 4004 (software: Apact LPC V 1.19.10); UV-Vis Spectrometry Agilent Cary 8485 (software: Chemstation 
B.05.04[8]); AAS Agilent 200 Series AAS (software: SpectrAA v 5.4 Pro); Haematology analyser Sysmex Europe XN-1000 (software: XN- 
Series version 1993200-52); Phenom Pro Desktop SEM (software: SW version 4.3.3) 


Data analysis Origin Pro 9.0, Excel for Mac v.15.33 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


Source data for figure(s) 2, 3, 4 (and extended data figures containing data graphs) are provided with the paper. Any other data that support the findings of this 
study are available from the corresponding author upon reasonable request. 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size For blood pumping studies to determine hemolysis 3 (anonymous) blood donors each donated 50 mL, which was used to make 6 mL samples 
for each series of experiments (see section M17 for a more detailed description). For studies to determine whether magnetite nanoparticles 
were transferred from the ferrofluid to the blood during pumping 2 anonymous donors were used. 


Data exclusions | Three Qoumps were made (using 3D printing / glued magnet arcs, see section M16). Pump number 3 made loud mechanical noises and its 
rotation was irregular. All data obtained using pump number 3 was discarded for that reason. 


Replication Blood pumping experiments were done on the same Qpumps, using different blood donors, and on different days/weeks. The results were 
always consistent. 


Randomization No randomization was used. 


Blinding No blinding was used. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 
| 


Animals and other organisms 


Human research participants 


Clinical data 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Blood donors were anonymous, and consist of male / female healthy adults that regularly donate blood to the French blood 
center (EFS). See ethics oversight below. 


Recruitment A dozen of donors come to the French blood center (EFS) in Strasbourg every day. No specific recruitment or selection was done 
for this study. 


Ethics oversight Human studies were performed according to the Helsinki Declaration. Control human samples were obtained from volunteer 
blood donors who gave written informed consent recruited by the blood transfusion center where the research was performed 
(Etablissement Francais du Sang-Grand Est). Patients were recruited 
at the University Hospital of Strasbourg and signed a statement giving informed consent (no. CODECOH: DC-2015-2368, Comité 
de Protection des personnes Est-IV no. 15/50 du 15/02/2016) that residual blood collected for classical care can be used for 
research purposes after irreversible anonymization. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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The storage of organic carbon in the terrestrial biosphere directly affects atmospheric 
concentrations of carbon dioxide over a wide range of timescales. Within the 
terrestrial biosphere, the magnitude of carbon storage can vary in response to 


environmental perturbations such as changing temperature or hydroclimate’, 
potentially generating feedback on the atmospheric inventory of carbon dioxide. 
Although temperature controls the storage of soil organic carbon at mid and high 
latitudes”, hydroclimate may be the dominant driver of soil carbon persistence in the 
tropics**; however, the sensitivity of tropical soil carbon turnover to large-scale 
hydroclimate variability remains poorly understood. Here we show that changes in 
Indian Summer Monsoon rainfall have controlled the residence time of soil carbon in 
the Ganges-Brahmaputra basin over the past 18,000 years. Comparison of 
radiocarbon ages of bulk organic carbon and terrestrial higher-plant biomarkers with 
co-located palaeohydrological records’ reveals a negative relationship between 
monsoon rainfall and soil organic carbon stocks ona millennial timescale. Across the 
deglaciation period, a depletion of basin-wide soil carbon stocks was triggered by 
increasing rainfall and associated enhanced soil respiration rates. Our results suggest 
that future hydroclimate changes in tropical regions are likely to accelerate soil 
carbon destabilization, further increasing atmospheric carbon dioxide 


concentrations. 


The size of the atmospheric CO, reservoir (around 750 Pg C at present) 
is modulated in part by exchanges with active reservoirs of biospheric 
organic carbon (BOC), such as marine primary producers, dissolved 
organic carbon, terrestrial vegetation and soils. Of these, the terres- 
trial carbon cycle remains the least well constrained’, reflecting the 
complexity of terrestrial ecosystem sensitivity to changes in atmos- 
pherictemperature®’, CO, concentrations”, nutrient availability” and 
hydroclimate’. In particular, the sensitivity of soil carbon stocks (witha 
global reservoir of 1,600 Pg C) to climate remains unresolved, hindering 
projections of carbon cycling under future climate-change scenarios. 
Soil carbon storage is a function of inputs from leaf and root detritus 
associated with net primary productivity; and outputs through lateral 
export via rivers, CO, and CH, efflux due to root respiration and micro- 
bial decomposition, and fire emissions". Soil carbon respiration rates— 
and thus turnover times—respond directly to temperature changes’. 
However, recent findings have revealed that this effect is muted inthe 
tropics, where hydroclimate variability may have a more important 
role in climate/carbon-cycle feedback**. To test this hypothesis, we 
reconstruct how hydroclimate change since the Late Glacial period 
(24-18 ka) has affected the dynamics of soil organic carbon cycling in 
acontinent-sized tropical basin. Specifically, we determine the mean 
age of terrestrial BOC—a proxy for the dynamics of the soil carbon res- 
ervoir—eroded fromthe Ganges-Brahmaputra river basin and stored 


in Bengal Fan sediments. We then compare temporal variations in 
this mean age with variations in the intensity of the Indian Summer 
Monsoon (ISM) to reveal the sensitivity of tropical soil carbon stocks 
to changing hydroclimate. 


Post-glacial climate proxy records 


Rivers integrate sediments, organic matter and dissolved constituents 
from across their drainage basins, allowing downstream archives to 
capture basin-scale changes in, for example, environmental conditions, 
sediment fluxes and organic carbon cycling”. The radiocarbon (*C) 
content of BOC laterally exported with river sediments provides an 
estimate of the integrated mean age of BOC in the basin, and, as such, 
can be used as a proxy for the dynamics of terrestrial organic carbon 
cycling. Riverine particulate organic carbon is, however, known to 
be highly heterogeneous”: it may include “C-free petrogenic carbon 
derived from rock erosion and BOC that is a mixture of fresh organic 
matter derived from, for example, aquatic productivity and leaf litter, 
as well as organic matter that is ‘pre-aged’—that is, stored in soils and 
floodplain deposits before export (for example, see ref. '°). As such, in 
addition to bulk organic carbon, we use compound-specific “C dating 
of molecular biomarkers specifically produced by terrestrial vascular 
plants: long-chain leaf-wax n-alkanoic acids (C,,_3, fatty acids). 
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Fig. 1| Correlation between palaeoclimate proxies with organic-matter age 
structure. a, b, Post-glacial precipitation 5D, values, derived from ice-volume- 
and vegetation-fractionation-corrected fatty acid 6D values (see ref. °and 
Methods for details), are plotted against the degree of pre-ageing of organic 
matter (given as FR, in dimensionless units). a, F“R values (n = 35) of individual 


To reconstruct variability in soil-carbon storage dynamics within 
the Ganges-Brahmaputra basin following the Late Glacial, we couple 
4C dating of bulk organic carbon and C,,_,, fatty acids in fine-grained 
turbidite beds from a suite of three gravity piston cores collected in 
the Bengal Fan channel-levee system (see Methods and Extended Data 
Fig. 1). These cores record basin-scale hydroclimate and vegetation 
change over a roughly 18,000-year period (Supplementary Table 1 
and Extended Data Fig. 2), notably an order-of-magnitude shift in 
the strength of the ISM between the Late Glacial/Heinrich Stadial 1 
(HS1; 18-15 ka) and the Holocene Climatic Optimum (HCO; 10.0- 
6.5 ka)®°. Combining these palaeoenvironmental signatures with 
time-varying lipid age structures from this same suite of biomarkers 
allows us to quantify the sensitivity of soil-carbon storage within this 
tropical drainage basin to changing hydroclimatic conditions over 
millennial timescales. 


Climatic control of organic carbon age structure 


Values for the fraction modern (Fm) of bulk organic carbon (see Meth- 
ods;n=89) range from 0.085 to 0.648 (Supplementary Table 2). Apply- 
ing a constant 3% relative contribution of petrogenic organic carbon 
(following modern values”; see Methods), we calculate Fm values for 
bulk BOC of 0.089-0.683. This large range primarily reflects deposition 
spanning nearly the past 18,000 years: corrected for deposition age, 
the radiocarbon age of bulk BOC since the Late Glacial at the time of 
delivery to the Bengal Fan (its ‘reservoir offset’ value; see Methods and 
ref. 8) ranged between around 1,300 and 5,850 “C years (Supplemen- 
tary Table 2). The mean bulk BOC reservoir offset—3,250 “C years—is 
nearly identical to those determined for modern Ganges-Brahmaputra 
river BOC”. C,,and weighted-average C,, 3, fatty acid homologues have 
reservoir offset values of around 640-6,150 “C years (average 2,790 “C 
years) and approximately 375-6,000 “C years (average 2,672 “C years), 
respectively (Supplementary Table 3 and Extended Data Fig. 3a). We 
find relatively close agreement between bulk BOC and corresponding 
C,,and C,,_3 fatty acid reservoir offset values; however, bulk BOC fora 
givensampleis systematically more pre-aged (thatis, older at the time 
of export owing to storage on land) than are corresponding long-chain 
fatty acids (Extended Data Fig. 3b), reflecting the contribution of more 
diverse, and older, organic matter to the bulk biospheric pool”. 
Characterized in terms of ‘reservoir relative enrichment’ (F“R; 
see Methods and ref. '®), the age structures of our bulk BOC and fatty 
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standard deviation (6D,) errors. VSMOW, Vienna standard mean ocean water; 
R?, coefficient of determination. 


acids are closely correlated with the hydrogen stable-isotope compo- 
sition of precipitation (6Dp) determined from the same compounds” 
(Fig. 1 and Extended Data Fig. 3); weaker correlations with bulk BOC 
(Extended Data Fig. 4a) probably reflect more heterogeneous source 
inputs contributing to bulk organic carbon. We can further refine 
our organic carbon age estimates by taking advantage of the results 
of ref. ”, in which numerical modelling was applied to “C signatures 
of biomarkers from a sediment core collected from the head of the 
Bengal Fan channel-levee system, which captures the past 70 years of 
sediment deposition and was used to estimate the timescales of BOC 
storage within the modern watershed. They divide the pool of C3,_5, 
fatty acids into ‘fast’ (for example, leaf litter) and ‘slow’ (for example, 
mineral soil organic carbon) cycling components. These contribute 
fatty acids cycled on decadal (average age 15 calendar years) and cen- 
tennial to millennial (average age 1,000-1,200 calendar years) time- 
scales, respectively; the latter constitute 79-83% of the long-chain fatty 
acid pool. Assuming that fresh, decadal (roughly 15-year cycling time) 
BOC has always contributed to the Ganges-Brahmaputra rivers from 
vegetation debris and soil litter at rates similar to today’s (see Meth- 
ods), we calculate millennial bulk BOC and biomarker ages for the past 
18 kyr. We find that, following deglaciation, both closely track—with an 
apparent lag of roughly 1,000 years—hydroclimate shifts determined 
from both these same Bengal Fan channel-levee cores and independ- 
ent records (Fig. 2). 

Together, these results indicate that variations in monsoon strength 
led directly to changes in terrestrial BOC mean age since the Late Gla- 
cial. Using C,, fatty acids, for which correlation is the most robust, we 
find that a10%o shift in 85D, values corresponds toa roughly 0.1 change 
in millennial F“R. This represents, for example, a change in bulk BOC 
mean age of 2,600 “C years between HS1 and the HCO, corresponding 
toadecrease in terrestrial BOC storage times of more than 50%. Given 
the concurrent transition of terrestrial vegetation from dominantly 
C4 (savanna-like) to dominantly C3 (forest-like)®, which would prob- 
ably increase overall soil carbon storage capacity, this change in BOC 
storage time suggests a direct, positive influence of the warmer and 
wetter conditions within the Ganges-Brahmaputra basin on soil carbon 
turnover rates. 

Given modern-system age distributions”, it is not possible to produce 
C,, FR values of less than about 0.80, even by increasing the contribu- 
tion of the millennial BOC component to 100%. Thus, the only means 
by which to achieve the F“R values observed throughout the deglacial 
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Fig. 2 | Temporal variations in climate forcings and soil organic-carbon 
storage. a—e, Comparison of: post-glacial wet-extraction atmospheric CO, 
concentrations from Dome Fujilce Core* using the gas-age profile (a); ISM 
strength as given by acomposite Chinese cave speleothem 50 record* (b), 
by 5°0 of planktonic foraminifers (Globigerinoides ruber) (n= 81) (c) and 
calculated as precipitation 6D, from terrestrial biomarker 5D from the Bengal 
Fan channel-levee (n= 30) (d); and records of the age structure of bulk 
biospheric organic carbon (red circles; n=116) and biomarkers (C3, fatty acids; 
blue squares; n= 9) exported from the Ganges-Brahmaputra basin (e). Datain 


and Holocene period is to increase the age of the millennial component 
of BOC. This is supported by the high abundance of fatty acids during 
warm, wet periods (Extended Data Fig. 5) and by moderate correla- 
tions between fatty-acid abundance and each bulk and biomarker F“R 
(Extended Data Fig. 6). This reflects the export of biomarker-lean (that 
is, degraded), old BOC during cooler, drier periods, and implies that 
our Late Glacial and HS1 millennial BOC age estimates are conserva- 
tive: given any contribution of decadal BOC during these periods (that 
is, adecadal-to-millennial BOC ratio of more than O), then an even 
older millennial BOC contribution is required to produce such low 
BOC FR values. Our data thus indicate enhanced (with respect to mod- 
ern) pre-ageing of BOC within the Ganges-Brahmaputra basin during 
the cooler, drier periods associated with a weak ISM. The subsequent 
post-Late Glacial and pre-HCO decreases in the age of the millennial 
component of the BOC delivered to the Bengal Fan correspond directly 
to warming and moistening of the basin (Fig. 2). 


Implications for the global carbon cycle 

The old apparent age of BOC laterally exported to the Bengal Fan sug- 
gests that the modern Ganges-Brahmaputra basin stores and exports 
large quantities of BOC that accumulated over centuries to millen- 
nia”’. Basin-wide, climate-driven changes in this post-deglaciation age 
structure track not soil carbon provenance (see Methods), but rather 
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c, dare replotted from ref. °) with our updated core age-depth model. Datain 
eare given as the difference between the age of the millennial component of 
biomarkers/bulk BOC and the deposition age (in “C yrs). Solid lines inc, eare 
three-point moving averages. Pre-1950s Ganges-Brahmaputra river dataine 
are fromref.”. Vertical error bars indicate propagated measurement 
uncertainties (reservoir offset values) or multimeasurement standard 
deviation (8D) errors for instrumental measurements; horizontal error barsin 
c-e are core age-model deposition-age error estimates (see Methods). 


the residence time of that soil carbon within the terrestrial environ- 
ment, at least over millennial timescales. Thus, the decrease in millen- 
nial BOC mean age from the Late Glacial to post-HS1 periods (Fig. 2e) 
corresponds to an acceleration in soil carbon turnover (a decreasing 
mean age) by a factor of nearly two. This would have resulted from 
enhanced soil organic carbon respiration, providing a positive feedback 
on atmospheric CO, concentrations, with a partial reversal in recent 
millennia following the HCO. 

Considering only the post-Late Glacial period reveals another fea- 
ture of the observed soil carbon-climate feedback. Given that BOC 
delivered to the coastal ocean represents only around 3% of total 
annual soil-carbon turnover within the modern Ganges-Brahmapu- 
tra basin (the remaining 97% is attributed to CO, fluxes to the atmos- 
phere through heterotrophic respiration and fires*), accounting for 
the observed 4,100 “C-year change in millennial BOC age through 
lateral carbon fluxes (erosion and fluvial export) between HS1 and 
the Balling—Allergd, without a commensurate increase in soil respi- 
ration fluxes, would require a nearly twofold increase in lateral BOC 
fluxes. This same change requires only a 7.5% increase in soil respiration 
fluxes (see Methods). Although our data do not allow us to distinguish 
between these mechanisms, the dependence of soil organic carbon 
turnover on hydroclimate is best explained by an imbalance between 
increasing organic carbon heterotrophic respiration and net primary 
productivity. The observed 1,000-year lag in millennial reservoir offset 
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values as compared with hydroclimate proxy records (Fig. 2) probably 
represents the system response time: an instantaneous shift in soil turn- 
over will take centuries to shift the overall age structure of soil carbon 
and associated laterally exported organic carbon. Moreover, correla- 
tions are much weaker between the millennial age structure of organic 
carbon and atmospheric CO, concentrations (Fig. 2a) than with direct 
measures of hydroclimate (Fig. 2b-d). This suggests that soil fertiliza- 
tion (increase in productivity related to increasing CO,) may not exerta 
first-order control on soil organic carbon turnover, but rather is filtered 
through the (hydro)climate system. This implies that, in the tropics, 
the climate/carbon-cycle feedback is not simple and one-directional, 
but rather is responsive to the impact of CO,-associated warming on 
terrestrial precipitation patterns, confirming at the millennial scale, 
andina globally important system, observations from subdecadal-scale 
arid-climate macrocosm experiments”. 

The Ganges-Brahmaputra rivers are globally important in terms 
of sediment load (1-2 x 10° t yr™; ref. 7), carbon burial (roughly 
4.7 x10” g yr“, around 20% of the global terrestrial BOC burial flux; 
refs. 3), and marine sequestration of terrigenous organic carbon 
at timescales of thousands to millions of years°””*. Our results dem- 
onstrate that warmer, wetter conditions associated with post-glacial 
strengthening of the ISM led to release of pre-aged BOC from within 
this basin. Although it is not clear that increased atmospheric CO, 
concentrations (and associated climatic warming) directly lead to 
increased ISM rainfall intensity”, we find an apparent positive feed- 
back between hydroclimate and atmospheric CO, concentrations, 
acting through changes in the soil carbon stock over millennial time- 
scales, most likely triggered by enhanced heterotrophic respiration. 
In this manner, we find alow-latitude climate feedback similar to that 
reported for high-latitude permafrost soils”’. This supports findings 
from carbon-cycle models of glacial gross primary productivity and 
terrestrial carbon stocks, which suggest overall turnover times and 
associated large ‘inert’ terrestrial carbon stocks (in soils, permafrost 
and wetlands) nearly doubled during the last glacial period”’. Our study 
reveals that tropical soils may have contributed substantially to this 
glacial ‘inert’ terrestrial carbon pool— previously attributed nearly 
exclusively to permafrost”. Our results further show that future hydro- 
climate change—in which decreases in the strength of the ISM are likely 
to be more than compensated by increased atmospheric moisture 
content, leading to more overall precipitation”®”*—is likely to further 
accelerate soil organic carbon destabilization (turnover rates and/or 
net loss of carbon stocks) in the tropics, increasing atmospheric BOC 
fluxes through enhanced soil decomposition or transport-induced oxi- 
dation, and thus further increasing atmospheric CO, concentrations. 
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Methods 


Sediment cores 

Three gravity piston cores were retrieved at water depths of 2,540- 
2,610 m along the middle fan channel-levee system (inner-levee, 
outer-levee, and levee-proximal) during the February 1994 (Expe- 
dition SO93) cruise of the R/V SONNE~, coincident with sediment 
echosounder data (Extended Data Fig. 1; see ref. ° and its associated 
supplementary information for details). Sediment core age models 
are updated from ref. ° and based ona recalibration of 22 reported°” 
radiocarbon (*C) ages derived from planktonic foraminifera 
(Globigerinoides sacculifer and Globigerinoides ruber) and accelerator 
mass spectrometry (AMS) radiocarbon dating of planktonic G. ruber 
from an additional four sedimentary units within core SO93-118KL 
(Supplementary Table 1). Radiocarbon ages were calibrated using OxCal 
4.2 (ref. *) with the IntCal13 curve™ and a regionally averaged marine 
reservoir correction (AR) of —4 + 42 years (data from refs. **°). Age 
models were calculated in the age-depth modelling software package 
Clam 2.1 (ref. *”) with extrapolation to core bottoms (Extended Data 
Fig. 2). Together, these capture the last 17.5 kyr of deposition, with 
linearized sediment accumulation rates of less than50 cmkyr“‘to more 
than 500 cm kyr‘ (see ref. ° for details). 


Bulk organic carbon radiocarbon analysis 

Following fumigation acidification, radiocarbon analysis was com- 
pleted on powdered bulk-sediment aliquots from 89 samples (each 
representing 1-10 cmthick core sections). Samples were fumigated by 
sealing ina vacuum desiccator with a beaker of 50 ml 12N HCI, exposed 
to HCI fumes for 60-72 hat 60-65 °C to remove carbonates, and dried 
in a separate desiccator for an additional 24 h. Radiocarbon analyses 
were conducted at either the National Ocean Science Accelerator Mass 
Spectrometry (NOSAMS) facility (Woods Hole, MA) or with the ETH 
Zurich AMS ‘MICADAS system (see ref. ** for details). Detailed results 
are provided in Supplementary Table 2. 


Compound-specific radiocarbon analysis 
Fatty acids derived from lipid extracts from a subset of nine samples 
were purified by aminopropyl-silica-gel column chromatography, 
derivatized with methanol of known isotopic composition to pro- 
duce fatty acid methyl esters (FAMEs), and then further purified by 
aminopropyl-silica-gel chromatography and silver nitrate-silica-gel 
chromatography (see ref. ° for details). Six individual, saturated, 
long-chain FAMEs (n-C,,, N-C3,, N-Cy, N-C39, N-C, and n-C,,) were purified 
and collected according to the preparative capillary gas chromatog- 
raphy (PCGC) method’””’. This was done through a suite of up to 100, 
3-5 plinjections of asaturated FAME solution in iso-octane (individual 
homologues at 50-200 ng pI concentrations) on either an Agilent 
7890A or an HP 5890 Series II gas chromatograph (GC) coupled to a 
Gerstel preparative fraction collector (PFC). PCGC traps were recov- 
ered with 4 ml of DCM and further purified to remove column bleed 
by 1% deactivated silica-gel column chromatography, using around 
3cmof gel and eluting FAME homologues in4 ml of DCM. Homologue 
purity was checked by injecting a small aliquot on a GC-flame ioniza- 
tion detector (FID), and yields were determined to a range between 
around 40% and 80%. To increase sample size and reduce analytical 
uncertainty during radiocarbon analyses, it was necessary to combine 
homologues following isolation by PCGC for two samples: (1) for sample 
S$093-120KL1,130-1,140 cm, two sets of homologues were combined: 
n-C,,and n-C,, (treated as a C,, sample for purposes of data analysis) 
and n-Cyg, N-C39 and n-C;3, (treated as a C,, sample); and (2) for sample 
S093-120KL 632-642 cm, n-C;, and n-C,, homologues were combined 
(treated as a C,, sample). 

PCGC-purified FAME homologues were dissolved in DCM and 
loaded into premuffled quartz tubes. Solvent was evaporated under 
a high-purity nitrogen stream and combusted copper oxide (roughly 


150 pg) was added to the quartz tube. Quartz tubes were evacuated 
to less than 30 pTorr and flame-sealed on a vacuum line following 
chilling using dry ice/isopropanol slurry and liquid nitrogen cryogenic 
traps to prevent sublimation of FAMEs. Samples were combusted in 
flame-sealed quartz tubes at 850 °C for 5h, and, the following day, 
were cracked under vacuum; the released CO, was cryogenically puri- 
fied and manometrically quantified. The sample CO, was trapped using 
liquid nitrogen, flame-sealed in a pyrex tube, and sent to either ETH 
Zurich or NOSAMS where they were analysed for “C content as a gas 
target with the Micadas gas ion source as described** (ETH Zurich), 
or asa graphite target following catalytic conversion as described*™ 
(NOSAMS). 

Resulting “C measurements were corrected for procedural blanks 
(the blank correction varies among the samples, depending on the mass 
of the measured fractions; see Supplementary Table 3), and for the addi- 
tion of derivative carbon during methylation with MeOH witha known 
4C composition, using mass-balance equations (see ref.” for details). 

Atotal of 40 “C measurements were made on homologues (or com- 
bined homologues) from the 9 samples selected for analysis. Not all 
homologues were measured from all samples owing to insufficient 
compound availability or loss during sample preparation (for example, 
shattering of quartz tubes). Radiocarbon data are expressed as the 
fraction of modern carbon, Fm, precorrected for mass-dependent 
fractionation using the °C/”C ratio measured online. 

We observed relatively small differences in stable-isotope (6D)° and 
radioisotope (Extended Data Fig. 3a and Supplementary Table 3) com- 
positions among C,,_,, fatty acid homologues. Fatty acids generally, 
although not universally, become more “C-depleted with increasing 
chain length, aphenomenon commonly associated with the selective 
preservation of long-chain homologues’. C,, fatty acids—which were 
consistently the most abundant homologue (average concentration 
0.64 pg g') and yielded the highest average C mass for “C analysis 
(48.8 pg)—and weighted-average C,,_,, fatty acids are the focus of dis- 
cussion. 


*C deposition ages 

Sediment deposition ages (in “C years) are determined from “C analysis 
of planktonic foraminifera (G. sacculifer and G. ruber; Supplementary 
Table 1), corrected for a standard 400-year surface ocean “C reser- 
voir age offset. Deposition of 15 of our 118 samples is directly dated; 
the remaining deposition ages (with errors) are calculated through 
interpolation from “C age-depth models constructed in the Clam 2.1 
software package”. 


Radiocarbon terminology, conventions and calculations 

Sample “C contents are given in the nondimensional ‘fraction modern’ 
(Fm) notation—a measurement of the deviation of a sample *C/?C 
ratio from 95% of the radiocarbon concentration at ‘modern’ (1950 CE) 
of NBS oxalic acid I (SRM 4990B, OX-l), normalized to a 6°Cypp, value 
of -19% (see refs. ** *°). Given that Fm is a measure of the “C/"C ratio, 
it is, by definition, a measure of the radiocarbon content of a sample, 
and can thus be related to the radiocarbon (uncalibrated) age of the 
sample as follows: 


4C age =— 8033 x In(Fm) 


Here, we seek to determine the amount of ‘pre-aging’ that organic 
matter has undergone before export and delivery to the coastal ocean, 
as ameans of estimating soil organic carbon turnover rates. To accom- 
plish this, we use the ‘reservoir offset’ metric, defined’* as a measure of 
the radiocarbon age offset, in“C years, between two contemporaneous 
carbon reservoirs. This metric relies on the ‘relative enrichment’ of 
the reservoir, defined’ as the ratio of Fm values from two contempo- 
raneous carbon reservoirs (x and y) at a given time (in our case, the 
time of deposition in the Bengal Fan). It is thus a measure of the age 
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(in “C years) of organic matter at the time of its deposition. Relative 
enrichment is given by the notation F“R in dimensionless units, as 
follows: 


Fm 
14 = x 
Fores Fm, 


By convention, F“Ris dimensionless and ranges from 0 to 1 by placing 
the more commonly enriched reservoir (y) inthe denominator. From 
this, the reservoir age offset (R,_,) can be determined: 


Ry-y =~ 8033 x In(F4R,_y) 


In order to calculate reservoir age offset values through time, it is 
necessary to ‘uncalibrate’ organic-matter deposition ages derived from 
the core-age models (derived from calibrated “C ages of foraminifera, 
and plotted in calibrated (or ‘calendar’) years before present (BP), where 
‘present’ is set by convention tothe year 1950 CE). For this, we take advan- 
tage of the Bayesian ‘uncalibration-convolution process’ approach of 
ref. ”, which propagates uncertainties linked to the reservoir-derived 
*C age (in this case, bulk organic matter or biomarker Fm values), a 
weakly a priori known calendar age (in this case, derived from core-age 
models), and the atmospheric calibration curve (from IntCal13; 
ref. **). Specifically, we apply to our biomarker, bulk organic carbon, 
and bulk biospheric organic carbon ages the radcal script of the ResAge 
Package”, designed for pairs of reservoir-derived radiocarbon ages 
and a corresponding weakly known (with some uncertainty) calendar 
ages. This program uncalibrates age-model-derived deposition ages 
(that is, it converts these to “C years) and calculates reservoir offset 
and FR values for each input. 


Determination of millennial BOC ages 

FR values for each bulk and fatty acid homologue sample can be 
expressed as a function of the age (in Fm) of each the millennial (mil) 
and decadal (dec) pools, multiplied by the fractional contribution of 
each of those pools. 

We hold the contributions to the overall Ganges-Brahmaputra 
organic-carbon pool of each of the decadal BOC (F,,.), millennial BOC 
(F,,i), and petrogenic organic carbon (F,,-) constant through time. Spe- 
cifically, petrogenic carbon is assumed to contribute 3% of all organic 
carbon to the bulk carbon pool (BOC is roughly 97% of total organic 
carbon), whereas fatty acids are free of petrogenic carbon (BOC roughly 
100%); this assumption is founded in analyses of each river and Bengal 
Fan core-top sediments, which reveal that Ganges-Brahmaputra river 
sediments delivered to the Bay of Bengal at present contain roughly 
0.03% petrogenic organic carbon"”**’, In addition, structural character- 
ization (Raman and high-resolution transmission electron microscopy) 
of petrogenic organic carbon in sediments from the Bengal Fan dating 
back to the Miocene“, coupled with evidence’ for high burial efficiency 
of terrestrial organic carbon over the past 20,000 years, suggests that 
the petrogenic organic-carbon concentration in Bengal Fan sediments 
has remained constant since the Last Glacial. Finally, we would antici- 
pate that, if petrogenic contributions changed in any systematic man- 
ner through time, it would be reflected in bulk total organic carbon 
(TOC) values; indeed, bulk TOC shows no correlation either with the 
relative enrichment of bulk organic-carbon radiocarbon (R?= 0.0036; 
Extended Data Fig. 7a), or with petrogenic-contribution-corrected BOC 
relative enrichment (R? = 0.0025) or millennial BOC relative enrich- 
ment (R?= 0.0024). 

Given these constraints, we apply time-invariant millennial and dec- 
adal contributions to BOC of 80% and 20%, respectively (values for 
modern Ganges-Brahmaputra rivers are from ref. ”). The fatty-acid 
decadal pool is assumed to always turnover quickly (by definition), 
and is assigned a constant 15-year cycling time (F“R = 0.9981)”. The 
4C age of the millennial BOC is calculated at each time step as follows: 


PoRGe = FAR nit * Fil * EP Rag: * Fcc y FAR xc "s Fixc 


Constraining grain-size and palaeovegetation effects 

Sediment organic-carbon concentrations are highly sensitive to sedi- 
ment grain size, which, within the Ganges-Brahmaputra rivers and 
Bengal Fan, shows close correspondence with the production of alu- 
minosilicate clays, and thus sediment AI/Si content®””455°, However, 
neither the grain size (Al/Si) nor the grain-size-normalized TOC values of 
the Bengal Fan channel-levee cores show any discernible trend through 
time®. Further, we find no correlation between organic-matter age with 
respect to deposition time (that is, F*R values) and either TOC or Al/Si 
values (Extended Data Fig. 7). 

Changes in vegetation since the Last Glacial were documented 
inref. ° using bulk and biomarker 8“C values from the same cores used 
here. These changes were accounted for in ref. ° in the development 
of the biomarker 8D, record (derived from the lipid 6D record); we 
directly used those same biomarker-derived 6D, values here: a subset 
is presented in each of Fig. 1 and Extended Data Fig. 4, and the full 8D, 
record (that is, lipid 6D values corrected for ice volume and palaeoveg- 
etation (from lipid 6°C values)) is given in in Fig. 2. 

Corrections of lipid 5D values for ice-volume and lipid vegeta- 
tion are described in full in ref. °, and summarized briefly as follows. 
Methanol-corrected 8D values from fatty-acid methyl esters (6D,,) 
were corrected for variations in seawater isotope composition 
related to global ice-volume variations during the deglaciation using 
a record of global relative sea-level changes* to infer variations in the 
oxygen-isotope composition of sea water (5'8O.y) related to varia- 
tions in global ice volume across the deglaciation. Relative sea-level 
variations were converted to variations in 6°O,,, using a scaling factor 
of 1.0% per 127.5 m of relative sea-level change™. Variations in 6 °O.w 
were subsequently converted to variations in 6D,, using the slope of 
the global meteoric water line. Finally, ice-volume-corrected FAMES 
5D values (6D,,.,y) were calculated by correcting measured 6D,, for 
changes in 6D,y relative to the modern seawater composition. 6D,y. 
values were then corrected for variable D/H fractionation by each of 
C3 and C4 vegetation endmembers to estimate precipitation 6D val- 
ues (8D,). Estimated D/H fractionation factors (€) of -125%o (100% C3 
trees vegetation; €,,) and —145%o (100% C4 grasses vegetation; €,,) 
were applied on the basis of variable C3 and C4 mixing proportions 
determined from sample FAMES 8*C values following ref. ©. 


Constraining soil carbon fluxes 

Under steady-state conditions, the soil carbon turnover time (7,) canbe 
given asa function of the total mass of soil carbon (M,) and soil carbon 
inputs (NPP (F,)) and outputs (F,,,,), where F,,,, is a combination of fluxes 
from heterotrophic respiration (F,), fire emissions (F,) and river erosion 
and export (lateral fluxes, F,) as follows: 


M, M 


= Ss 
sa ay () 

We first assume that, since the Last Glacial, F,,,, responds only to 
changes in F, and F,. Following the Last Glacial, and through the HCO, 
climate in the Ganges-Brahmaputra basin gradually becomes wetter, 
and vegetation trends towards greater dominance by C, forests, as 
opposed to C, savanna’. Together, these would be far more likely to 
decrease, as opposed to increase, fire intensity, and thus fire carbon 
emissions. Thus, holding F; constant presents a conservative esti- 
mate of changes in M, and/or F, and F, between the Last Glacial and 
post-HS1 periods. We also assume that F, and F, remain at their present 
relative proportions of 97% and 3%, respectively, of the non-F,;portion 
of F,.,,. Finally, we assume that soil (millennial) carbon always com- 
poses 80% of total carbon burial (see text for details) and that carbon 
burial efficiency (sediment-volume and grain-size-normalized 


burial rates) have remained near 100% since the Last Glacial (discussed 
inref.°). 

Our bulk organic-carbon radiocarbon results indicate that the 
reservoir-age offset (proxy for T,) of millennial (soil) BOC pools 
decreased by around 3,800 “C years (approximately 56%) between 
HS1and the HCO (Supplementary Table 2). Although changing either 
F, Or Fou, would affect the steady-state turnover time (as per equation 
(1)), it has no immediate impact on the average age of soil organic car- 
bon, and thus the average BOC reservoir offset value. Thus, observed 
post-Last Glacial shifts in laterally fluxed BOC are best explained by 
changes in Ganges—Brahmaputra soil carbon stocks over this time 
(thatis, time-varying M,). For example, holding F, and F,,, constant, we 
would expect that M, would vary between approximately 33.2 Pg C at 
HS1, to roughly 14.4 Pg C at the HCO (a depletion of soil carbon stocks 
of roughly 56%), and to its modern documented value of 19.52 Pg C 
(see ref. *). Likewise, between HS1 and the post-Bglling-Allergd millen- 
nial organic-carbon reservoir offset minimum (around 2,500 “C years), 
basin-wide carbon stocks decreased by an amount (about 20.9 Pg) 
nearly equivalent to the modern basin-wide storage. 

Given that BOC delivered to the coastal ocean represents only 
around 3% of total annual soil-carbon turnover within the Ganges-— 
Brahmaputra basin (about 0.0036 Pg yr; refs. 4°), the HS1 to 
post-Bolling—Allergd decrease in basin-wide soil carbonstocks, if driven 
entirely by changes in river carbonexport, would require lateral fluxes to 
nearly double for this roughly 3,000-year period. By contrast,amere 7.5% 
increase in heterotropic respiration rates would be required to account 
for the observed decrease in soil carbon stocks between HS1 and the 
Bolling—Allergd. 


Constraining impacts of soil carbon provenance 

Changes in the age structure of BOC laterally exported from the 
Ganges-Brahmaputra rivers cannot be explained by sediment-source 
variability from within-basin soil profiles (for example, because of ero- 
sion of deeper, older carbon during cooler and drier periods, potentially 
inresponse to reduced vegetation cover). Since deglaciation, sediments 
exported by the Ganges-Brahmaputra rivers have become gradually 
more chemically weathered°™, whereas the BOC exported with these 
same sediments has become younger; thus, sediment ‘age’ (as given 
by pre-export weathering) and BOC age structures are decoupled. 
Changes in BOC age structure also cannot be explained by changes in 
sediment provenance from within the basin (for example, by shifting 
foci of monsoon precipitation associated with changes in monsoon 
strength®). Whereas sediment may be sourced from throughout the 
Ganges-Brahmaputra basin, BOC delivered to the Bengal Fan is pre- 
dominantly floodplain in origin, as upland organic carbon is subject 
to efficient degradation and replacement during transit. BOC age 
structures should thus be insensitive to sediment sourcing upstream 
of the floodplain. 


Data availability 


All new data produced for this study are from samples from Ben- 
gal Fan cores SO93-117KL, -118KL and -120KL. These are available in 
Supplementary Tables and online in the EarthChem Library (http:// 
www.earthchem.org/). Specifically, these present a compilation of 
radiocarbon age-dating results from planktonic foraminifera used 
in derivation of core age-depth models (Supplementary Table 1 and 
https://doi.org/10.1594/IEDA/111486); results of radiocarbon analyses 
of bulk organic carbon and calculated reservoir offset and F“R values of 
bulk and millennial BOC (Supplementary Table 2 and https://doi.org/ 
10.1594/IEDA/111487); results of radiocarbon analyses of fatty-acid 
homologues and associated calculated reservoir offset and F“R val- 
ues of bulk homologues and the subset of those cycled on millennial 


timescales (Supplementary Table 3 and https://doi.org/10.1594/ 
IEDA/111488); and abundances of fatty-acid homologues (Supplemen- 
tary Table 4 and https://doi.org/10.1594/IEDA/111489). Source data 
for Figs. 1, 2 and Extended Data Figs. 2-7 are provided with the paper. 
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® Check for updates 


Inrecent decades, intensive research on non-avian dinosaurs has strongly suggested 


that these animals were restricted to terrestrial environments’. Historical proposals 
that some groups, suchas sauropods and hadrosaurs, lived in aquatic environments”” 
were abandoned decades ago* ©. It has recently been argued that at least some of the 
spinosaurids—an unusual group of large-bodied theropods of the Cretaceous era— 
were semi-aquatic’®, but this idea has been challenged on anatomical, biomechanical 
and taphonomic grounds, and remains controversial? “. Here we present 
unambiguous evidence for an aquatic propulsive structure in a dinosaur, the giant 


theropod Spinosaurus aegyptiacus 


712. This dinosaur has a tail with an unexpected and 


unique shape that consists of extremely tall neural spines and elongate chevrons, 
which forms a large, flexible fin-like organ capable of extensive lateral excursion. 
Using a robotic flapping apparatus to measure undulatory forces in physical models 
of different tail shapes, we show that the tail shape of Spinosaurus produces greater 
thrust and efficiency in water than the tail shapes of terrestrial dinosaurs and that 
these measures of performance are more comparable to those of extant aquatic 
vertebrates that use vertically expanded tails to generate forward propulsion 

while swimming. These results are consistent with the suite of adaptations for an 
aquatic lifestyle and piscivorous diet that have previously been documented for 
Spinosaurus’*", Although developed toa lesser degree, aquatic adaptations are also 
found in other members of the spinosaurid clade*”’, which had a near-global 
distribution and a stratigraphic range of more than 50 million years”, pointing toa 
substantial invasion of aquatic environments by dinosaurs. 


Detailed anatomical and functional studies, combined with abundant 
trackways, all point to a strictly terrestrial ecology for dinosaurs}, 
with one clade (Maniraptora) taking to the air”. Dinosaurs are not 
currently thought to have invaded aquatic environments, follow- 
ing the abandonment-several decades ago*°—of century-old ideas 
of semi-aquatic habits in sauropods and hadrosaurs”’. Potential 
semi-aquatic lifestyles have recently been hypothesized for a small 
number of dinosaurs'®*’. However, the only group of dinosaurs for 
which multiple plausible lines of evidence indicate aquatic adapta- 
tions are the spinosaurids, large-bodied theropods interpreted as 
near-shore waders that fed on fish along the margins of (rather than 
within) bodies of water!©”°, 

Arecent study’ of the largest known spinosaurid, S. aegyptiacus, 
identified a series of adaptations consistent with a semi-aquatic life- 
style, including reduced hindlimbs, wide feet with large flat unguals, 
long bones witha highly reduced medullary cavity, anda suite of cranial 


features (such as retracted nares, interlocking conical teeth anda 
rostromandibular integumentary sensory system). This interpreta- 
tion has been challenged on the basis of taphonomy’, biomechani- 
cal modelling” and anatomical concerns’. Locomotion in water is a 
major point of contention”, because no unambiguous evidence for 
a plausible mode of propulsion has been presented. Furthermore, 
our understanding of the anatomy and ecology of this highly derived 
theropod has been hampered because only one associated Spino- 
Saurus skeleton exists, with all other associated remains having been 
destroyed in World War II’. The posterior portion of the skeleton and 
the caudal vertebral series in particular, which has the potential to 
shed light on likely adaptations for aquatic locomotion, has until now 
been poorly understood”. Consequently, the tail anatomy and func- 
tion of Spinosaurus has been reconstructed on the basis of highly 
incomplete remains and potentially spurious comparisons with other 
similar-sized theropods. 
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Fig. 1| Reconstructed skeleton and caudal series of FSAC-KK 11888. 
a, b, Caudal series (preserved parts shown in colour) in dorsal view (a) and left 
lateral view (b). c-e, Reconstructed sequential cross-sections through the tail 
show proximal-to-distal changes in the arrangement of major muscles. 


Here we describe a nearly complete and partially articulated tail of 
a subadult individual of S. aegyptiacus (accession code Faculté des 
Sciences of Casablanca University (FSAC)-KK 11888), from the Creta- 
ceous Kem Kem beds of south-eastern Morocco (Figs. 1, 2, Extended Data 
Figs.1-4, Supplementary Information section1, Supplementary Video 1). 
The skeleton represents, to our knowledge, the most complete dinosaur 
known from the Kem Kem beds”” and the most complete skeleton 
of a Cretaceous theropod known from mainland Africa (Supplemen- 
tary Information section 2). As we show here, the tail forms part of the 
neotype of S. aegyptiacus’ and was found in direct juxtaposition to the 
remainder of the skeleton (Extended Data Fig. 3). The newly recovered 
material confirms the previous conclusion’ that a single subadult indi- 
vidual is preserved at the site; over 90% of the new material was recov- 
ered during field excavations in late 2018, and then digitally recorded 
(Extended Data Figs. 1-4, Supplementary Information sections 2-5). 
Several elements conform closely to drawings of the Spinosaurus fossils 
that were destroyed in World War II (Extended Data Fig. 6). 

More than 30 near-sequential caudal vertebrae (located within caudal 
positions 1-41) of FSAC-KK 11888 are preserved, and represent approxi- 
mately 80% of the original tail length (Extended Data Figs. 3, 4, Extended 
Data Tables 1,2). Both proximal and distal elements of the tail are complete 
and preserved in three dimensions, indicating minimal taphonomic distor- 
tion (Fig. 2, Supplementary Video 2). At the level of the caudal transition 
point!, the centra become proportionally more elongate. In addition, the 
prezygapophyses no longer overhang the preceding centrum and showa 
marked decrease in size compared to those of many theropod dinosaurs’. 
The postzygapophyses also decrease in size (Fig. 2), leading to a reduced 
contact with the prezygapophyses, and are absent inthe distalmost caudal 
vertebrae. This again is different from the condition seen in most thero- 
pods, inwhich zygapophyses become more elongate and more prominent 
towards the tail tip’, restricting flexibility in more distal intervertebral joints. 

The neural arches are distinctive elements of the Spinosaurus tail. 
Anotably complex array of vertebral laminae and fossae is present in 
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f, Sequential cross-sections through the neural spine of caudal vertebra 23 
(Ca23) to show apicobasal changes. g, Skeletal reconstruction. Scale bars, 
50cm (a-e), 10 cm (f),1m(g). 


the proximal caudal vertebrae, and partly persists in mid-caudal neu- 
ral arches. The morphology of the neural spines shows considerable 
variation along the sequence (Figs. 1, 2, Extended Data Table 1): the 
spines of the proximal caudal vertebrae are about three times taller than 
their centra and are cross-shaped in cross-section from their base to 
mid-height; in mid-caudal vertebrae, the spines become much longer; 
and inthe small distal caudal vertebrae, the length of the neural spines 
reaches well over seven times the height of the centrum, in contrast 
to the condition suggested in a previous study”. The neural spines of 
mid-distal caudal vertebrae of Spinosaurus have a cross-section that 
is unique among theropods: they are proximodistally—rather than 
transversely—flattened. This is owing to the hyper-development of 
the spinodiapophyseal laminae and the loss of pre- and postspinal 
laminae. The chevrons also differ from those of other theropods. The 
morphology of the chevrons in Spinosaurus varies little throughout the 
caudal series, except for a slight gradual reduction of the haemal canal: 
distal chevrons are as elongate as the proximal chevrons (Extended 
Data Table 2) but become slender, paralleling the gradual decrease in 
the size of the centra. Taken together, the elongate neural and haemal 
arches result in a tail shape that is markedly vertically expanded and 
has an extensive lateral surface area (Fig. 1, Extended Data Figs. 3, 4). 
The skeletal anatomy of Spinosaurus represents a major departure 
from that of other theropods—including from that of other members 
of the Tetanurae clade (which comprises crown group birds and all 
other stem theropods more closely related to birds than to Cerato- 
saurus'). One feature of the Tetanurae is a stiffened tail in which the 
degree of overlap in articulation between pre- and postzygapophy- 
ses increases along the caudal series, greatly diminishing the range 
of motion between individual vertebrae’. This trend toward reduced 
mobility is emphasized in paravians, with the appearance of ossified 
ligaments and/or reduction and fusion of the caudal vertebrae into a 
pygostyle”. By contrast, in Spinosaurus the pre- and postzygapophyses 
are much further reduced than in other tetanurans and—in the middle 
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Fig. 2 | Selected caudal vertebrae and chevrons of FSAC-KK 11888. 

a, Proximal caudal vertebra (Ca4) in left proximolateral view. b, c, Proximal 
chevron (Chv7) in left lateral (b) and proximal (c) view. d, Distal caudal vertebra 
(Ca31) in left lateral view. e, Mid-caudal vertebra (Ca12) in right proximolateral 
view. f, g, Distal chevron (Chv27) in left lateral (f) and proximal (g) views. 

h, Mid-caudal vertebra (Cal16) in right distolateral view. acdl, anterior 
centrodiapophyseal lamina; c, centrum; ca, caudal vertebra; cdf, 
centrodiapophyseal fossa; chva, chevron articulation; chvh, chevron head; 
cpol, centropostzygapophyseal lamina; cprl, centroprezygapophyseal lamina; 
hc, haemal canal; na, neural arch; ns, neural spine; pcdl, posterior 


and distal portions of the tail—not only do not overlap, but almost dis- 
appear (Fig. 2); this allows the caudal region considerable flexibility, 
especially with regard to lateral movements. 

We hypothesized that the highly specialized morphology of the 
Spinosaurus tail allowed it to function as a propulsive structure for 
aquatic locomotion. To test this idea, we evaluated the swimming 
potential of the Spinosaurus tail shape by comparing it to the tails of 
two terrestrial theropods (Coelophysis bauri and Allosaurus fragilis), 
two semi-aquatic tetrapods (the crocodile Crocodylus niloticus and 
the crested newt Triturus dobrogicus) and a rectangular control. 
Two-dimensional tail shapes were cut from 0.93-mm-thick plastic of flex- 
ural stiffness 5.8 x 10° Nm”. The plastic tails were attached to a robotic 
controller and actuated ina water flume to provide tail-tip amplitudes that 
were approximately 40% of tail length during swimming at 0.5 tail lengths 
persecond. Thisswimming speed andamplitude of motionis similar to that 
of slow aquatic locomotion in modern tetrapods” *. We measured swim- 
ming performance by quantifying the mean thrust and efficiency using 
asix-axis force—torque sensor attached to the shaft that drove each tail 
shape” (Fig.3, Methods, Supplementary Fig. 4, Supplementary Videos3-5). 

Our experimental results show that the Spinosaurus tail shape was 
capable of generating more than 8 times the thrust of the tail shapes 
of other theropods, and achieved 2.6 times the efficiency (Fig. 3, Sup- 
plementary Data 1). The greatest thrust was achieved by the tail shape 
of the crested newt (1.8 times that of Spinosaurus and 14.8 times that of 
Coelophysis), but the crocodile tail shape achieved greater propulsive 
efficiency (1.5timesthat of Spinosaurusand 4.0timesthat of Coelophysis), 
comparable to the rectangular control (Fig. 3). The lower efficiency 
recovered in this experiment for Spinosaurus (compared to the control 
withthe same surface area) and the crested newt indicates an effect of 
tail shape on performance. Overall, the vertically expanded tail shape of 
Spinosaurus imparts a substantial positive benefit to aquatic propulsion 
relative to the long and narrowtails of terrestrial theropods, supporting 
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centrodiapophyseal lamina; po, postzygapophysis; pocdf, postzygapophyseal 
centrodiapophyseal fossa; podl, postzygodiapophyseal lamina; posdf, 
postzygapophyseal spinodiapophyseal fossa; posl, postspinal lamina; pr, 
prezygapophysis; prcdf, prezygapophyseal centrodiapophyseal fossa; prdl, 
prezygodiapophyseal lamina; prsdf, prezygapophyseal spinodiapophyseal 
fossa; prsl, prespinal lamina; spdl, spinodiapophyseal lamina; spol, 
spinopostzygapophyseal lamina; spof, spinopostzygapophyseal fossa; sprl, 
spinoprezygapophyseal lamina; sprl-f, spinoprezygapophyseal lamina fossa; 
tp, transverse process. Scale bars, 10 cm (a),5cm (b-h). 


the inference that Spinosaurus used tail-propelled swimming. This tail 
morphology may have also increased the lateral stability of the body 
in the water, reducing the tendency to roll while floating”. 

Contrary to recent suggestions” that Spinosaurus was confined to 
wading and the apprehension of prey from around the edges of bod- 
ies of water, the morphology and function of its tail—along with its 
other adaptations for life in water’— point to Spinosaurus having been 
an active and highly specialized aquatic predator that pursued and 
caughtits prey in the water column (Extended Data Fig. 7). The skeletal 
remains of Spinosaurus (Supplementary Information) from the Kem 
Kem beds—composed of sediments deposited ina major fluvio-deltaic 
system’ that have yielded a diverse vertebrate assemblage”’—provide 
further insights into the ecology of this dinosaur. The composition of 
the ecosystem represented by the Kem Kem assemblage is highly atypi- 
cal, containing a rich freshwater fauna dominated by fishes (including 
lungfish and large-to-very-large sawfish and coelacanths”), a diverse 
range of crocodyliforms”® and several giant predatory dinosaurs’”. 
The seemingly anomalous occurrence in the same deposits of sev- 
eral large-bodied predators but few terrestrial herbivores is partially 
explained by the largely aquatic and probably piscivorous lifestyle 
of Spinosaurus, which considerably expands the morphological and 
ecological disparity of Kem Kem tetrapods”. At the same time, com- 
petition with several co-occurring large aquatic predators”®* may have 
driven the evolution of giant size in Spinosaurus. 

Although the unique postcranial adaptations of Spinosaurus point 
towards an entirely novel mode of locomotion within Dinosauria, other 
spinosaurids share a wide range of derived anatomical features that are 
consistent witha partially aquatic, piscivorous mode of life”*""*°. The 
exact extent to which an aquatic lifestyle was adopted by these taxa, 
and how this varied across Spinosauridae, remains to be established. 
However, the near-global distribution of spinosaurids (which have now 
been reported from Europe, Asia, Africa and South America®’) and their 
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Fig. 3 | Comparative tail swimming performance. a, Spinosaurus plastic tail 
shape attached toa robotic drive shaft in a water flume with water flowing at 
10cms_. With reference to the tail, positive x forces (F,) are generated 
proximally (or upstream), positive y forces (F,) in the right lateral direction, and 
positive z forces (F,) inthe ventral direction. b,c, Mean thrust (b) and mean 
efficiency (c) generated by tail shapes during robotically controlled swimming. 
Alltails are scaled to 20-cm length (Supplementary Fig. 4). The control tail was 
rectangular in shape with the same surface area as the scaled Spinosaurus tail 
(63 cm’). See Methods for experimental setup. Raw thrust and efficiency data, 
including mean ands.e.m., are provided in Supplementary Data1. Swimming 
motions of the Spinosaurus tail are visualized in Supplementary Videos 3-5. 


substantial temporal range (first appearing, based on phylogenetic 
inference, inthe Mid—or possibly even Early—Jurassic epoch and with 
a fossil record that spans more than 50 million years (from the Late 
Jurassic to the early Late Cretaceous epoch)") point toa persistent and 
widespread invasion of aquatic habitats by dinosaurs. 


Online content 


Any methods, additional references, Nature Research reporting sum- 
maries, source data, extended data, supplementary information, 


70 | Nature | Vol581 | 7 May 2020 


acknowledgements, peer review information; details of author con- 
tributions and competing interests; and statements of data and code 
availability are available at https://doi.org/10.1038/s41586-020-2190-3. 


1. Weishampel, D. B., Dodson, P. & Osmolska, H. The Dinosauria 2nd edn (Univ. of California 
Press, Berkeley, 2004). 

2. Owen, R. A description of a portion of the skeleton of the Cetiosaurus, a gigantic extinct 
saurian reptile occurring in the oolitic formations of different portions of England. Proc. 
Geol. Soc. Lond. 3, 457-462 (1841). 

3. Cope, E. On the characters of the skull in the Hadrosauridae. Proc. Acad. Nat. Sci. 
Philadelphia 35, 97-107 (1883). 

4. Kermack, K. A. A note on the habits of sauropods. Ann. Mag. Nat. Hist. 4, 830-832 (1951). 

5. Bakker, R. T. Ecology of the brontosaurs. Nature 229, 172-174 (1971). 

6. Alexander, R. M. Mechanics of posture and gait of some large dinosaurs. Zool. J. Linn. 
Soc. 83, 1-25 (1985). 

7. — Ibrahim, N. et al. Semiaquatic adaptations in a giant predatory dinosaur. Science 345, 
1613-1616 (2014). 

8.  Aureliano, T. et al. Semi-aquatic adaptations in a spinosaur from the Lower Cretaceous of 
Brazil. Cretac. Res. 90, 283-295 (2018). 

9. Evers, S. W., Rauhut, O. W. M., Milner, A. C., McFeeters, B. & Allain, R. A reappraisal of the 
morphology and systematic position of the theropod dinosaur Sigilmassasaurus from the 
“middle” Cretaceous of Morocco. PeerJ 3, e1323 (2015). 

10. Henderson, D. M. A buoyancy, balance and stability challenge to the hypothesis of a 
semi-aquatic Spinosaurus Stromer, 1915 (Dinosauria: Theropoda). PeerJ 6, e5409 
(2018). 

11. Hone, D. W. E. & Holtz, T. R. Jr. Comment on: Aquatic adaptation in the skull of carnivorous 
dinosaurs (Theropoda: Spinosauridae) and the evolution of aquatic habits in spinosaurids. 
93: 275-284. Cretac. Res. https://doi.org/10.1016/j.cretres.2019.05.010 (2019). 

12. Stromer, E. Ergebnisse der Forschungsreisen Prof. E. Stromers in den Wiisten Agyptens. 
ll. Wirbeltier-Reste der Baharije -Stufe (unterstes Cenoman). 3. Das Original des 
Theropoden Spinosaurus aegyptiacus nov. gen., nov. spec. Abh. Kgl. Bayer. Akad. Wiss. 
Math. Phys. Kl. Mtinchen 28, 1-28 (1915). 

13. Vullo, R. et al. Convergent evolution of jaws between spinosaurid dinosaurs and Pike 
Conger Eels. Acta Palaeontol. Pol. 61, 825-829 (2016). 

14. Arden, T.M.S., Klein, C. G., Zouhri, S. & Longrich, N. R. Aquatic adaptation in the skull of 
carnivorous dinosaurs (Theropoda: Spinosauridae) and the evolution of aquatic habits in 
Spinosaurus. Cretac. Res. 93, 275-284 (2019). 

15. Charig, A. J. & Milner, A. C. Baryonyx walkeri, a fish-eating dinosaur from the Wealden of 
Surrey. Bull. Nat. Hist. Mus. Geol. 53, 11-70 (1997). 

16. Sues, H. D., Frey, E., Martill, D. M. & Scott, D. M. Irritator challengeri, a spinosaurid 
(Dinosauria: Theropoda) from the Lower Cretaceous of Brazil. J. Vert. Pal. 22,535-547 
(2002). 

17. Witmer, L. M. in Mesozoic Birds: Above the Heads of Dinosaurs (eds Chiappe, L. M. & 
Witmer, L. M.) 3-30 (Univ. California Press, 2002). 

18. Tereschenko, V. Adaptive features of protoceratopoids (Ornithischia, Neoceratopsia). 
Paleontol. J. 42, 273-286 (2008). 

19. Cau, A. et al. Synchrotron scanning reveals amphibious ecomorphology in a new clade of 
bird-like dinosaurs. Nature 552, 395-399 (2017). 

20. Sereno, P. C. et al. A long-snouted predatory dinosaur from Africa and the evolution of 
spinosaurids. Science 282, 1298-1302 (1998). 

21. Lavocat, R. Sur les dinosauriens du Continental Intercalaire des Kem-Kem de la Daour. In 
Comptes rendus 19e Congrés Géologique International, Alger, 1952 65-68 (Academie des 
Sciences de Paris, 1954). 

22. Sereno, P. C. et al. Predatory dinosaurs from the Sahara and Late Cretaceous faunal 
differentiation. Science 272, 986-991 (1996). 

23. D’Aott, K. & Aerts, P. Kinematic and swimming efficiency of steady swimming in adult 
axolotls (Ambystoma mexicanum). J. Exp. Biol. 200, 1863-1871 (1997). 

24, Fish, F. Kinematics of undulatory swimming in the American alligator. Copeia 1984, 
839-843 (1984). 

25. Frolich, L.M. & Biewener, A. A. Kinematic and electromyographic analysis of the 
functional role of the body axis during terrestrial and aquatic locomotion in the 
salamander Ambystoma tigrinum. J. Exp. Biol. 162, 107-130 (1992). 

26. Lauder, G. V., Flammang, B. & Alben, S. Passive robotic models of propulsion by the 
bodies and caudal fins of fish. Integr. Comp. Biol. 52, 576-587 (2012). 

27. Cavin, L. et al. Vertebrate assemblages from the early Late Cretaceous of southeastern 
Morocco: an overview. J. Afr. Earth Sci. 57, 391-412 (2010). 

28. Meunier, L. M. V. & Larsson, H. C. E. Revision and phylogenetic affinities of Elosuchus 
(Crocodyliformes). Zool. J. Linn. Soc. 179, 169-200 (2017). 

29. Amiot, R. et al. Oxygen and carbon isotope compositions of middle Cretaceous 
vertebrates from North Africa and Brazil: ecological and environmental significance. 
Palaeogeogr. Palaeoclimatol. Palaeoecol. 297, 439-451 (2010). 

30. Hone, D. W. E. & Holtz, T. R. A century of spinosaurs. A review and revision of the 
Spinosauridae with comments on their ecology. Acta Geol. Sin. 91, 1120-1132 (2017). 


Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 


© The Author(s), under exclusive licence to Springer Nature Limited 2020 


Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Excavation and reconstruction 

The Cretaceous Kem Kem beds of Morocco crop out along an exten- 
sive escarpment, often located near the Moroccan-Algerian border’. 
After the accidental discovery and partial excavation by a local collec- 
tor in 2008, part of a single skeleton (FSAC-KK 11888)—subsequently 
deposited at the Faculté des Sciences of Casablanca University—was 
recovered, published and designated as the neotype of S. aegyptiacus’. 
Amulti-institutional collaborative project in the years 2015-2019, led by 
N.I., resulted in four joint expeditions to the neotype site. Detailed and 
careful exploration of the debris around the site, as well as asystematic 
and extended excavation of the unexposed portion of the fossiliferous 
layer of the Zrigat hill, led to the recovery of many additional elements 
of the neotype skeleton (Extended Data Figs. 1-5). A detailed descrip- 
tion of the newly recovered material, as well as the geological context, is 
included inthe Supplementary Information. The Supplementary Infor- 
mation also includes details of a full-body flesh reconstruction of Spi- 
nosaurus based on FSAC-KK 118838, as well as estimates of whole-body 
mass, segment masses, segment centres of mass and whole-body centre 
of mass (Supplementary Data 2). The position of the centre of mass in 
comparison to that in previously published analyses”° can be found 
in Extended Data Fig. 8. 


Osteohistological analysis 

The aim of the osteohistological analysis was to determine whether 
the remains that were assigned to FSAC-KK 11888 belong to a single 
individual, rather than a chimeric association of juvenile and adult 
individuals preserved in the same location and at the same horizon. The 
analysis was based on five skeletal elements. The primary assumption 
is that, should histological details suggest that all five elements repre- 
sent the same ontogenetic stage, then the remains are more likely to 
represent one individual than multiple individuals. By contrast, should 
these elements exhibit two or more distinct ontogenetic stages, this 
would point to the presence of multiple individuals of one taxon (or 
perhaps several taxa), all fortuitously preserved at a single location 
during a single depositional event” °°. 

The following elements were sectioned: the right femur; the left 
fibula; one rib; and two neural spines. All specimens were sectioned 
before preparation to ensure that no outer layers of the compact cortex 
were accidentally removed. In the case of the neural spines, the apical 
portion was sectioned. 

Thin sectioning followed standard protocols*™. The thin sections 
have a thickness of 50-70 xm, and were analysed with a petrographic 
microscope (Leica DM 2500 P). Digital images were captured using a 
ProgRes Cfscan camera. Only continuous lines were counted as lines 
of arrested growth. Annuli were interpreted as a single year, follow- 
ing a previous publication®. Retrocalculation, following a previously 
published method”, was applied to determine the likely number of 
missing lines of arrested growth, eroded through remodelling of the 
bone. In the case of the neural spines, only the width of the innermost 
zone was used to retrocalculate the missing lines of arrested growth, 
because the shape of the section could not be approximated to a cir- 
cular outline. The calculation of the major and minor axes used for the 
retrocalculation was performed in Image]”. Results of the histological 
analysis are included in the Supplementary Information. 


Experimental testing of tail-shape swimming performance 

To test the aquatic locomotor potential of the newly reconstructed 
S. aegyptiacus tail, we determined the swimming performance of its 
tail shape using a robotic controller developed for studies of propulsive 


hydrodynamics** ”, The swimming performance of the Spinosaurus tail 
shape was compared to the performance of five other tail shapes from 
the following species: the small-bodied terrestrial theropod C. bauri, 
the large-bodied terrestrial theropod A. fragilis, the semi-aquatic croc- 
odile C. niloticus, the semi-aquatic crested newt 7. dobrogicus anda 
rectangular control tail that was scaled to the same surface area as the 
Spinosaurus tail. Tail shapes (Supplementary Fig. 4) were all scaled to 
20 cm proximodistal length, manufactured from 0.93-mm-thick plas- 
tic of flexural stiffness 5.8 x 10° Nm? and cut using an Epilog Zing 24 
laser cutter. 

The plastic tails were attached to a robotic controller that allowed 
us to impose specific motion programmes on the rigid shaft to which 
each tail was affixed (Fig. 3, Supplementary Videos 3-5). This shaft was 
moved in both heave (side-to-side) motion, as well asin pitch (angular 
rotation), to achieve undulatory tail motions. The imposed motion 
programme was 1-Hz frequency, +1-cm heave and +25° pitch, which 
resulted in the tail tip undergoing peak-to-peak lateral excursions of 
approximately 40% of the proximodistal length, comparable to that 
exhibited by swimming axolotls and alligators”* >. 

The shaft supporting each simulated tail at the leading edge was 
attached to an ATI (Apex) Nano-17 six-axis force-torque sensor located 
just above the water surface. Testing occurred ina recirculating water 
flume, and a free-stream flow of 0.51 (10 cm s7) was imposed for all 
tests. Custom LabVIEW programs (National Instruments) were used 
to control flapping frequency, flow speed, heave and pitch. A custom 
LabVIEW program also was used to acquire data from the ATI trans- 
ducer at a sampling rate of 1,000 Hz. Each tail shape was tested n=5 
times (except for the Spinosaurus tail, which was tested n=5 times on 
2 different days for a total of n=10 tests). Output data can be found in 
Supplementary Data 1. 

Thrust and efficiency for each tail shape were calculated using stand- 
ard fluid dynamic equations as in previous research****, Mean thrust 
force (F,) is calculated directly from transducer output from the F,, 
channel, and we accounted for transducer rotation resulting from the 
pitch motion to provide the force component directed upstream 
(positive thrust). Propulsive efficiency is calculated as the ratio 
of the thrust coefficient (C;= 2Fx/pU’cs) to the power coefficient 
(C= 2P/pU°cs), inwhich pis the fluid density, Vis the swimming veloc- 
ity, cis the foil chord and s is the tail span. Effectively, this metric 
assesses the extent to which input power is translated into thrust. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Excavation of the FSAC-KK 11888 site. a-f, Different 2018. e, 5 December 2018. f, 21July 2019. g, h, Selected bones in situ. g, Largely 
stages of the excavation, which resulted in the removal of over 15 tons of rock complete proximal caudal vertebra (Ca4). h, Neural spine of a mid-distal caudal 
using a range of tools, including picks, brushes, hammers and a jackhammer. vertebra, fragmented by syndiagenetic cracks. Scale bars, 10cm. 

a, 17 November 2013. b, 29 March 2015. c, 17 September 2018. d, 19 September 
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Extended Data Fig. 2| Excavation of the caudal elements of FSAC-KK 11888. mid-caudal vertebrae. c, Two haemal arches. d, Close association of middle 
a, Largely complete distal caudal vertebra (Ca31), recovered inits entirety by caudal elements. Scale bars, 10cm. 
digging a tunnel until the apex of the neural spine was reached. b, Semi-articulated 


Extension 
of the debris 
landslide ~ 45 m 


Extension 
of the debris 
landslide 12,8 m 


North 


2007/2008 dig (original cave”) 


_ 2015-2019 expeditions 
Debris 


B - See Extended data Figure 1g 
C - See Extended data Figure 1h 
D - See Extended data Figure 2a 
E - See Extended data Figure 2b 
F - See Extended data Figure 2c 
G - See Extended data Figure 2d 


A - Onchopristis teeth 
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Extended Data Fig. 3| Excavation map and skeletal reconstruction. excavations during the 2015-2019 expeditions (green) and sieving in the debris 
Detailed map of the site of discovery of FSAC-KK 11888, and fully revised area during the 2015-2019 expeditions (yellow). Both images are at the same 
skeletal reconstruction. Colours inthe map and reconstruction correspond to scale. Scale bar, 1m. 

different phases of excavation: the local discovery in 2007-2008 (red), our 
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Extended Data Fig. 4| The caudal series of FSAC-KK 11888. Photograph of 
the entire caudal series (numbered). Scale bar, 1m. 
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Extended Data Fig. 5 | Elements of FSAC-KK 11888 from 2008 (first 
excavation) and 2019 (most recent excavation), matched. a-t, Evidence of 
the perfect match between elements collected by the local discoverer of the 
site in 2007-2008 (a, e, f, i, k, m,n, p, q,S) and elements excavated in situ or 
recovered from the site debris during the 2015-2019 excavations (b-d,g,h,j,I, 
o,r,t).a, b, Right and left metatarsal II. c, Left penultimate phalanx of the fourth 
pedal digit (IV-4) that came to light within the typical matrix in which bones of 
the FSAC-KK 11888 were embedded. d, e, Two possible splenial fragments 


tive. 


reconnected. f-i, Phalanx IV-4 prepared and compared to the contralateral 
element of the right pes in dorsal view, and rearticulated with its ungual. 

j-m, Twocomplementary (broken) halves of the left squamosal and ofa dorsal 
rib.n-p,s,t, Two key fragments from the debris, reconnecting the base and the 
shaft ofa neural spine (possibly the 7th). q, r, The right astragalus (excavated 

in situ inJuly 2019) rearticulated to its tibia (from 2008). Arrows point to 
recomposed fractures. 
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Extended Data Fig. 6 | Comparison of FSAC-KK 11888 caudal vertebrae to 
those destroyed in World War II. Comparison between the caudal vertebrae 
ofthe neotype of S. aegyptiacus, with those of the two specimens (the holotype 
and aspecimen knownas ‘Spinosaurus B’, both of which are now lost) from the 
Bahariya Oasis (Egypt) described by E. Stromer’. a-d, Proximal caudal 
vertebra of the holotype (accession code BSP 1912 VIII19) in distal (a) and right 


lateral (c) views; and of Ca4 of FSAC-KK 11888 in distal (b) and right lateral (d) 
views. e-j, Proximal caudal vertebra of SpinosaurusB in dorsal (e), right lateral 
(g) and proximal (i) views; Call of FSAC-KK 11888 in dorsal (f), right lateral (h) 
and proximal (j) views. k-p, Middle caudal vertebra of Spinosaurus B in dorsal 
(k), left lateral (m) and distal (0) views; Ca21 of FSAC-KK 11888 in dorsal (I), left 
lateral (n) and distal (p) views. Scale bars=10 cm. 


Extended Data Fig. 7 | Three-dimensional fleshed-out model based on FSAC-KK 11888. a, b, Symmetrical pose in five views (a) and swimming pose (b). 
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A 
(0, -1.2900633, 0) 


B 
(10.3659996, -0.8113801, 0) 


Extended Data Fig. 8 | Whole-body centre of mass. a—d, Snout tip and tail tip approaches (Supplementary Information, Supplementary Data 2) and 
in the coordinate system (a) and centre-of-mass distance from the cranial compared to the centre of mass in ref.” (c); and ref.’ (d). 
margin of the acetabulum in this study (b), calculated using multiple 


Extended Data Table 1| Measurements of caudal vertebrae of FSAC-KK 11888 


number 
in the series 


Cal 
Ca2 
Ca3 
Ca4 
CaS 
Ca6 
Ca7 
Cas 
Ca9 
Caio 


Cail 
Cai2 
Cal3 
Cai4 
Cais 
Cal6 
Cal7 
Cais 
Cai9 
Ca20 
Ca21 
Ca22 
Ca23 
Ca24 
Ca25 
Ca26 
Ca27 
Ca28 
Ca29 
Ca30 
Ca31 
Ca32 
Ca33 
Ca34 
Ca35 
Ca36 
Ca37 
Ca41 


proximodistal 
length of 


centrum 


78 
94 

75 (p) 
98 
n.p. 
n.p. 
114 
112 
99 

115 (p, L) 
106 (p, R) 

113 
122 

106 (e) 
117 
120 
116 
n.p. 

102 (p) 
115 


75 (p) 


dorsoventral 


height 


transverse 
diameter 


proximal face proximal face of 


of centrum 
n.p. 
129 
n.p. 
116 
n.p. 
n.p. 
102 
97 
99 (e) 
97 (e) 


91 
101 
79 (e) 
64 (p) 
84 
87 
n.p. 
n.p. 
84 
107 
78 
80 
78 
78 
74 
77 
92 
71 
70 
n.p. 
64 
60 
67 
60 
n.p. 
n.p. 
46 
39 


centrum 
n.p. 
113 
n.p. 
106 
n.p. 
n.p. 
113 
108 
81 (p) 
76 (p, deformed) 


86 
75 
59 (p) 
n.p. 
80 
78 
n.p. 
n.p. 
70 
71 
77 
74 
72 
67 
81 
76 
66 
65 
71 
n.p. 
58 
58 
51 
62 
n.p. 
n.p. 
43 
37 


minimum 
transverse diameter 
of centrum 
(constriction) 
n.p. 
67 
n.p. 
73 
n.p. 
n.p. 
74 
64 
63 
47 


42 
32 (p, compressed) 
36 (p, slightly collapsed) 
n.p. 
35 
29 
29 
31 
n.p. 
n.p. 
29 
21 


maximum length of 
neural spine (dorsal to 
the neural canal, along 
the curvature if any) 
92 (p) 
n.p. 
n.p. 
355 (p) 
140 (p) 
59 (p) 
335 (p) 
n.p. 
n.p. 
240 (p) 


180 (p) 
250 (p) 
n.p. 
n.p. 
375 (p) 
285 (p) 
210 (p) 
145 (p) 
445 (p) 
405 (p) 
525 (p) 
285 (p) 
540 
535 (p) 
550 (p) 
480 (p) 
n.p. 
n.p. 
290 (p) 
295 (p) 
535 
325 (p) 
505 (p) 
290 (p) 
n.p. 
n.p. 
455 
384 


Measurements are in mm. (p), not complete, measured as preserved; n.p., not preserved; n.a., not applicable; e, estimated. 


maximum width at 
transverse processes 


320 (p, based on Ltp) 
n.p. 
n.p. 
480 (p, based on Ltp) 
n.p. 
n.p. 
n.p. 
n.p. 
n.p. 
n.p. 


200 (p, almost complete) 
210 (based on R) 
n.p. 

n.p. 

186 (p, based on L) 
106 (p, based on L) 
n.p. 

n.p. 

n.p. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 

n.a. 
na. 

n.a. 

n.a. 

n.a. 
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Extended Data Table 2 | Measurements of chevrons of FSAC-KK 11888 


number maximum height height (measured base transverse height haemal canal 
in the perpendicular to the width 
series proximodistal axis 
of the tail) 
Chv3 52. n.p. n.p. n.p. 
Chv5 91 n.p. n.p. n.p. 
Chv6 195 (p) 170 (p) 72 74 
Chv7 215 185 80 72 
Chv8 204 160 73 73 
Chv9 150 (p) 130 (p) n.p. n.p. 
Chv11 195 170 81 73 
Chv12 155 (p) 143 (p) n.p. n.p. 
Chvi9 160 (p) 125 (p) 72 64 
Chv22 103 (p) 90 (p) n.p. n.p. 
Chv24 224 205 65 65 
Chv25 44 (p) n.p. n.p. n.p. 
Chv27 210 185 63 55 
Chv28 155 (p) n.p. n.p. n.p. 
Chv29 n.p. n.p. n.p. n.p. 
Chv31 144 (p) 135 (p) 62 60 
Chv32 176 (p) 160 53 53 
180 (e) 
Chv33 140 (p) n.p. n.p. n.p. 
Chv34 33 (p) | n.p. | n.p. | n.p. 


Measurements are in mm. (p), not complete, measured as preserved; n.p., not preserved; e, estimated. 


Nizar Ibrahim 
j ld TC [¢ S¢ arc Corresponding author(s): (reference number: 2019-06-09085E) 


Last updated by author(s): Jan 30, 2020 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Custom LabVIEW programs (National Instruments Corp., Austin, TX, USA) for experimental testing of swimming performance. 


Data analysis Blender 2.79, ZBrush 4r7, and ClayTools 3D 2.0 for model rendering and sculpting; FlashPrint 3.26.0 to cut 3D model; Meshlab 2016.12 
and Python 3.7 (through platform Anaconda) for calculation of the Centre of Mass; Meshroom 2019.1.0 for photogrammetry; custom 
LabVIEW 2013 programs (National Instruments Corp., Austin, TX, USA) for experimental testing of swimming performance; ImageJ 1.51w 
for retrocalculation of missing Lines of arrested Growth (histology). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The authors declare that all data supporting the findings of this study are available within the paper and its Supplementary Information files (Source data). 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat. pdf 


Ecological, evolutionary & environmental sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description We present the first unambiguous evidence for an aquatic propulsive structure in a dinosaur, the giant theropod Spinosaurus 
aegyptiacus. This dinosaur has a tail with an unexpected and unique shape consisting of extremely tall neural spines, and elongate 
chevrons forming a large, flexible, fin-like organ capable of extensive lateral excursion. Using a mechanical flapping apparatus to 
measure undulatory forces in physical tail models, we show that the tail shape of Spinosaurus produces far greater thrust and 
efficiency than the tail shapes of terrestrial dinosaurs, comparable to that of extant aquatic vertebrates that use vertically expanded 
tails to generate forward propulsion while swimming. This conclusion is consistent with a suite of adaptations for an aquatic lifestyle 
and a piscivorous diet in Spinosaurus, and with a persistent and significant invasion of aquatic environments by spinosaurid 
dinosaurs. 
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Research sample A nearly complete, partially articulated fossil tail of a subadult individual of Spinosaurus aegyptiacus (FSAC-KK 11888), from the 
Cretaceous Kem Kem beds of south- eastern Morocco. Plastic tail shapes from the following species: Spinosaurus aegyptiacus, 
Coelophysis bauri, Allosaurus fragilis, Crocodylus niloticus, Triturus dobrogicus, and a rectangular control tail scaled to the same 
surface area as the Spinosaurus tail. 


Sampling strategy Systematic collection of a partial dinosaur skeleton (one individual). Tail shape experiments: Coelophysis and Allosaurus are well 
known taxa and reflect a basal theropod and large-bodied theropod respectively. The two extant aquatic tetrapods (salamander and 
crocodile) were chosen as they are known for utilizing tail-propelled swimming. 


Data collection N.I. led the expeditions and the project. N.I., S.M., C.D.S., M.F., M.A., D.M.M., G.B., S.Z. D.M. and A.A. collected the specimens in the 
field. N.I., S.M., C.D.S., M.F., J.W., G.V.L. and S.E.P. designed the research. N.I., S.M., C.D.S., M.F., J.W., G.V.L. and S.E.P. designed and 
performed the experiments. N.I., S.M., C.D.S., M.F., M.A., D.M.M., J.W., G.B., S.Z.D.M., D.M.U., U.J., J.J., A.A., G.V.L., and S.E.P. 
analysed the data. 


Timing and spatial scale 2018-2019; the specimen was collected from the Kem Kem beds, which crop out along the Moroccan-Algerian border for over 250 
km. The fossil was collected from a well-defined field site (site map included in our submission) located near the town of Zrigat. 


Data exclusions No data were excluded 

Reproducibility The experiments were repeated over multiple days to ensure consistent measures. All repeats were successful. 
Randomization We did not use quantitative approaches that would require randomization. 

Blinding Our study (field: palaeontology) does not include experiments that would require blinding. 

Did the study involve field work? Yes No 


Field work, collection and transport 


Field conditions Desert escarpment close to Moroccan Algerian border 


Location In- and ex-situ on the slopes of a south-east facing escarpment fringing the Aferdou Zrigat plateau (Tafilalt basin, Akrabou 
Formation, Kem Kem beds). See Supplemental Information. 


Access and import/export Permits for fieldwork were obtained from Ministére de l'Energie, des Mines, et de l'Environnement. Permits: 4581/DE/2019 
(issued on 17/07/2019) and 4118/DE/2018/DG (issued on 06.06.2018). The work was performed in close collaboration with 
researchers in Morocco (FSAC, Casablanca). The specimens collected are deposited at the Departement de Géologie/Laboratoire 
de Biodiversité et Santé, Faculté des Sciences Ain Chock, Hassan II University, Casablanca, Morocco. 


Disturbance No disturbance 
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Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 


n/a | Involved in the study n/a | Involved in the study 
[| Antibodies [| ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Palaeontology 


Specimen provenance Permits for fieldwork were obtained from Ministére de I'Energie, des Mines, et de l'Environnement. Permits: 4581/DE/2019 
issued on 17/07/2019) and 4118/DE/2018/DG (issued on 06.06.2018). The work was performed in close collaboration with 
researchers in Morocco (FSAC, Casablanca). The specimens collected are deposited at the Departement de Géologie/Laboratoire 
de Biodiversité et Santé, Faculté des Sciences Ain Chock, Hassan II University, Casablanca, Morocco. 
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Specimen deposition Laboratoire de Biodiversité et Santé, Faculté des Sciences Ain Chock, Hassan II University, Casablanca, Morocco 


Dating methods No new dates are provided 


Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 
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® Check for updates 


Vascular contributions to dementia and Alzheimer’s disease are increasingly 
recognized’ ®. Recent studies have suggested that breakdown of the blood-brain 
barrier (BBB) is an early biomarker of human cognitive dysfunction’, including the 
early clinical stages of Alzheimer’s disease**°. The E4 variant of apolipoprotein E 
(APOEA4), the main susceptibility gene for Alzheimer’s disease” “, leads to accelerated 
breakdown of the BBB and degeneration of brain capillary pericytes® ”, which 
maintain BBB integrity” ”. It is unclear, however, whether the cerebrovascular effects 
of APOE4 contribute to cognitive impairment. Here we show that individuals bearing 
APOE4 (with the €3/e4 or €4/¢4 alleles) are distinguished from those without APOE4 
(¢3/e3) by breakdown of the BBB in the hippocampus and medial temporal lobe. This 
finding is apparent in cognitively unimpaired APOE4 carriers and more severe in those 
with cognitive impairment, but is not related to amyloid-f or tau pathology measured 
in cerebrospinal fluid or by positron emission tomography”. High baseline levels of 
the BBB pericyte injury biomarker soluble PDGFRB’* in the cerebrospinal fluid 
predicted future cognitive decline in APOF4 carriers but not in non-carriers, even after 
controlling for amyloid-B and tau status, and were correlated with increased activity 
of the BBB-degrading cyclophilin A-matrix metalloproteinase-9 pathway” in 
cerebrospinal fluid. Our findings suggest that breakdown of the BBB contributes to 
APOE4-associated cognitive decline independently of Alzheimer’s disease pathology, 
and might bea therapeutic target in APOF4 carriers. 


Analysis of BBB permeability by dynamic contrast-enhanced magnetic 
resonance imaging (DCE-MRI)’® (Fig. 1a; see Methods) in 245 partici- 
pants (Extended Data Table 1) indicated that there was increased BBB 
breakdowninthe hippocampus (HC) and parahippocampal gyrus (PHG) 
in cognitively normal APOF4 (¢3/e4 and €4/e4) carriers, compared to 
cognitively normal APOE3 homozygotes (€3/e3), both with clinical 
dementia rating (CDR) scores of 0. The BBB breakdown in the HC and 
PHG in APOF4 carriers increased further with cognitive impairment 
at a CDR score of 0.5 (Fig. lb-d). This increase was independent of 
differences in amyloid-B (AB) and phosphorylated tau (pTau) in the 


cerebrospinal fluid (CSF) (Fig. le-h); that is, whether individuals were 
AB+ or AB- and pTau+ or pTau- using the accepted cut-off values’””*> 
(see Methods), where AB+ and pTaut+ status indicates classical Alzhei- 
mer’s disease (AD)-associated pathways”. By contrast, APOE3 carriers 
with cognitive impairment developed less pronounced BBB changes in 
the HC and PHG (Fig. Ib-d). We found no significant BBB differences 
in other grey or white matter brain regions between APOF4 carriers 
and APOE3 homozygotes, except for increased BBB permeability in 
the caudate nucleus and minor leaks in the frontal cortex and corpus 
callosum in cognitively normal APOE4 carriers (Extended Data Fig. 1). 
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Knight Alzheimer’s Disease Research Center, Washington University School of Medicine, St Louis, MO, USA. "Department of Psychiatry and Behavioral Sciences, University of Southern 
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Fig. 1| BBB breakdown in the HC and PHG in APOF4 carriers increases with 
cognitive impairment, independently of CSF Af and tau status. a, b, Maps 

of BBB permeability transfer constant (K,,,,,) generated by DCE-MRI (a) inthe 
HC of APOE3 homozygotes (APOE3) and APOF4 carriers (APOF4) with CDR 
scores of 0 or 0.5 (b). FA, flip angle; Tlw, Tl weighted. c, d, BBB K,,a,; in the HC (c) 
and PHG (d) inindividuals with CDR O bearing APOE3 (black, n=128) or APOEF4 
(red, n= 68) and with CDR 0.5 bearing APOE3 (black, n=14) or APOF4 (red, 
n=25).e, f, Kyran; in the HC (e) and PHG (f) in APOF4 carriers with CDR O who were 
AB,_42 negative (AB-; n = 37) or positive (AB+; n=16), or with CDR 0.5 who were 
AB-(n=7) or AB+(n=10).g,h, K;,-ans in the HC (g) and PHG (h) in APOF4 carriers 
with CDR O who were pTau- (n= 42) or pTau+t (n=10), and with CDR 0.5 who were 
pTau- (n=13) or pTau+ (n=5S).i, HC (blue) and PHG (orange) overlaid ona3D 


These findings held when cognitive dysfunction was evaluated by neu- 
ropsychological performance (see Methods) (Extended Data Figs. 2, 3). 

The volumes of the HC and PHG decreased with cognitive impairment 
in APOE4 but not APOE3 carriers (Fig. li-k). The breakdown of the BBB 
inthe HC and PHG in APOE4 carriers, but not APOE3 homozygotes, 
remained a highly significant predictor of cognitive impairment after 
we statistically controlled for age, sex, education, CSF AB and pTau 
status, and HC and PHG volumes, as shown by the estimated marginal 
means from the ANCOVA models (Fig. 1l, m), and confirmed by logis- 
tic regression models (Supplementary Table 1). The BBB dysfunction 
(Fig. 1c, d,1,m) preceded brain atrophy (Fig. 1j, k) and was independent 
of systemic vascular risk factors (Extended Data Fig. 4). 

Because both Af and tau can lead to blood vessel abnormalities and 
BBB breakdown?”°”’, we studied whether BBB disruption in APOEF4 car- 
riers was downstream from amyloid and tau accumulation ina subset 
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template.j, k, Volumes of the HC (j) and PHG (k) in individuals with CDRO 
bearing APOE3 (n=124) or APOF4 (n=75) and with CDR 0.5 bearing APOF3 
(n=13) or APOF4 (n=20).1, m, K,,ans (estimated marginal means +s.e.m. from 
ANCOVA models corrected for age, sex, education, CSF AB,_,, and pTau status, 
and HC and PHG volumes) in the HC (I) and PHG (m) in individuals with CDRO 
bearing APOE3 (black, HC n=125; PHGn=128) or APOF4 (red, HC and PHG 
n=68) and with CDR 0.5 bearing APOE3 (black, HC n=12; PHGn=14) or APOF4 
(red, HCn=20;PHGn=25).c-h,j,k, Continuous line, median; dotted line, 
interquartile range (IQR). Significance by ANCOVA for main effects and post 
hoc comparisons controlling for age, sex, and education. All ANCOVA omnibus 
tests remained significant at false discovery rate (FDR) threshold of 0.05. 


of 74 and 96 participants, respectively (Extended Data Tables 2a, b). 
Voxel-based analysis by positron emission tomography (PET) indicated 
a substantially higher accumulation of amyloid in the orbital frontal 
cortex (OFC) in cognitively normal APOF4 carriers compared to APOE3 
homozygotes, as reported’’, but did not detect accumulation of tau 
tracer in either APOF4 or APOE3 carriers (Extended Data Fig. 5a—d). To 
determine how BBB permeability relates to accumulation of amyloid 
and tau, we selected 5-mm-thick coronal slices in regions of interest 
that included the HC and PHG (where BBB disruption is seen first in 
APOF4 carriers compared to APOE3 homozygotes (Fig. 1b,d,e)), the 
OFC (where amyloid accumulation develops initially in APOE4 carriers), 
and the inferior temporal gyrus (ITG; a region that is affected early by 
tau pathology”’) (Extended Data Fig. 5b, d, e). 

Brain uptake of amyloid and tau tracers (after correction for the 
choroid plexus off-target binding for tau tracer; see Methods and 
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Fig. 2| Blood-brain barrier breakdown in APOE4 carriers is independent of 
amyloid and tau accumulation in the brain. All studies were performed in 
individuals with CDR score 0. a, Representative superimposed left HC amyloid 
PET (top), tau PET (middle), and BBB K,,.,, maps (bottom) from APOE3 (left) and 
APOEF4 (right) carriers. b,c, Amyloid and tau tracer uptake (b) and BBB K;,an; (€) 
in HC of APOE3 (n=45, 60, and 65) and APOF4 (n= 29, 37, and 31) carriers. 

d, Representative superimposed left PHG amyloid PET (top), tau PET (middle), 
and BBBK,,,,,, maps (bottom) from APOE3 (left) and APOF4 (right) carriers. 

e, f, Amyloid and tau tracer uptake (e) and BBB K,,a,s (f) in PHG of APOE3 (n=45, 
60, and 65) and APOF4 (n= 29, 37, and 31) carriers. g, Representative 
superimposed left medial OFC amyloid PET (top) and BBB K,,,, maps (bottom) 


Extended Data Fig. 5f, g) indicated no difference between APOF4 and 
APOE3 carriers inthe HC, although uptake of both tracers was modestly 
increased compared to the background values in cerebellum (Fig. 2a, b). 
The BBB was disrupted in the HCin APOE4 carriers compared to APOF3 
homozygotes (Fig. 2c), consistent with our findings in the larger cohort 
(Fig. 1b, c). There was no difference in amyloid and tau accumulation 
inthe PHG between APOEF4 carriers and APOE3 homozygotes, despite 
BBB disruption in APOF4 carriers (Fig. 2d—-f). Amyloid accumulation 
in the OFC was higher in cognitively normal APOF4 carriers than in 
APOE3 carriers (Fig. 2g, h), but there was no difference in BBB integrity 
(Fig. 2g, i). Inthe ITG, there were no differences in tau accumulation or 
BBB integrity between APOF4 and APOE3 carriers (Fig. 2j-1). Together, 


from APOE3 (left) and APOF4 (right) carriers. h, i, Amyloid tracer uptake (h) and 
BBB Kyran; (i) in OFC of APOE3 (n= 45 and 44) and APOF4 (n= 29 and 23) carriers. 
j, Representative superimposed left ITG tau PET (top) and BBB K,,.4,, Maps 
(bottom) from APOE3 (left) and APOF4 (right) carriers. k, I, Tau tracer uptake (I) 
and BBB K,,an; (I) in ITG of APOE3 (n= 60 and 59) and APOF4 (n= 37 and 28) 
carriers. b,c, e,f,h,i,k,1, Continuous lines, median; dotted lines, IQR. BBB Kyran 
was determined in all participants (see Extended Data Tables 2a, b) who 
received either both amyloid and tau tracers (n= 58), only amyloid tracer (n=9) 
or only tau tracer (n=29). Significance by ANCOVA for group comparisons 
controlling for age, sex, and education, and two-tailed t-tests for comparison of 
PET values to standardized uptake value ratios (SUVR) =1. 


these data suggest that BBB disruption in the HC and PHG in APOF4 
carriers is independent of AD pathology, and that BBB breakdown in 
APOF4 carriers starts in the medial temporal lobe, whichis responsible 
for memory encoding and other cognitive functions. 

In humans with AD and animal models, elevated levels of soluble 
platelet-derived growth factor receptor-f (SPDGFR#) in the CSF indi- 
cate that pericyte injury is linked to BBB breakdown’*” and cogni- 
tive dysfunction’*°. Using a median split for visual display of the CSF 
sPDGFRf baseline levels from 350 participants (see Methods), we 
stratified all participants into two groups, with low CSF sPDGFRf® levels 
(0-600 ng mI") and high sPDGFR levels (600-2,000 ng mI’) (Fig. 3a). 
In 146 APOEF4 carriers and APOF3 homozygotes who were evaluated 
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Fig. 3 | Elevated baseline CSF levels of sPDGFRB predict cognitive declinein 
APOE4 carriers. a, Histogram frequency distribution of CSF sPDGFR6 using 
median split to divide participants into two groups: high (blue, above median 
600-2,000 ng mI") and low (grey, below median 0-600 ng mI) baseline CSF 
sPDGFRB. All longitudinal analyses used baseline CSF sPDGFR# as acontinuous 
predictor of future cognitive decline. b,c, Linear mixed model analysis of all 
studied participants (n=146) followed at 2-year intervals for up to 4.5 years 
after baseline lumbar puncture (LP). Higher baseline CSF sSPDGFR (blue) 
predicts greater decline in demographically corrected mental status exam 
scores over time (P=0.01) (this remains significant after controlling (contr.) for 
CSF AB (P= 0.002) and pTau (P= 0.002) status; b), and in global cognitive 
composite scores (P= 0.01) (this remains significant after controlling for CSF 
AB (P=0.017) and pTau (P=0.01) status; c).d, e, Higher CSF sPDGFR® (blue) in 


by cognitive exams at 2-year intervals up to 4.5 years from baseline, 
participants with higher baseline CSF sPDGFR® exhibited accelerated 
cognitive decline ona global mental status exam and global cognitive 
composite z-scores, which remained significant after controlling for 
CSF AB and tau status (Fig. 3b, c; Supplementary Table 2). When strati- 
fied by APOE status, higher baseline CSF sPDGFR® levels in APOF4 car- 
riers predicted cognitive decline after controlling for CSF AB and pTau 
status (Fig. 3d, e; Supplementary Table 3), but did not predict decline 
in APOE3 homozygotes (Fig. 3f, g; Supplementary Table 4). 

The increase in CSF sPDGFRB with cognitive impairment was also 
found on cross-sectional CDR analysis in APOF4 carriers but not 
APOE3 homozygotes (Fig. 4a, b; Extended Data Table 3; Supplemen- 
tary Table 5). Increased levels of SPDGFR® in the CSF of APOF4 carri- 
ers correlated with increases in BBB permeability in the HC and PHG 
(Fig. 4c, d), and elevated levels of molecular biomarkers of BBB break- 
down including albumin CSF/plasma quotient, and CSF fibrinogen and 
plasminogen (Fig. 4e-g). 

Next, we focused on the proinflammatory cyclophilin A-matrix 
metalloproteinase-9 (CypA-MMP%9) pathway. When activated by brain 
capillary pericytes in APOF4 (but not APOE3) knock-in mice, this path- 
way leads to MMP9-mediated breakdown of the BBB, which in turn 
induces neuronal stress related to leaked blood-derived neurotoxic pro- 
teins followed by neuronal dysfunction and loss of synaptic proteins”. 
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APOE4 carriers (n= 58) predicts future decline in mental status exam scores 
(P=0.005) after controlling for CSF AB (P= 0.004) and pTau (P= 0.003) status 
(d), andin global cognitive composite scores (P= 0.02) after controlling for CSF 
AB (P=0.02) and pTau (P= 0.01) status (e). f, g, Baseline CSF sPDGFRB does not 
predict decline (n=88) in either mental status (f) or global composite (g) 
scores in APOE3 homozygotes regardless of CSF Af or pTau status. 

b-g, Separate lines indicate median split of baseline CSF sPDGFRB (grey, below 
median; blue, above median). A slopes provided for median split of baseline 
CSF sPDGFR® groups. tO =—1to 0.5 years post-LP, tl= 0.5 to 2.5 years post-LP, 
t2=2.5to4.5 years post-LP. Error bars shows.e. of the estimate. Linear mixed 
model analysis with no multiple comparison. See Supplementary Tables 2-4 for 
detailed statistics. 


Brain tissue analysis has also shown higher activation of the CypA- 
MMP9% pathway in degenerating brain capillary pericytes in APOE4 
carriers than in APOE3 homozygotes”. In our cohort, APOF4 carriers, 
but not APOE3 homozygotes, developed an increase in CypA CSF levels 
with cognitive impairment (Fig. 4h, i), which correlated with elevated 
CSF sPDGFR® (Fig. 4j). APOE4 carriers, but not APOE3 homozygotes, 
also developed elevated MMP9 in the CSF with cognitive impairment 
(Fig. 4k), which correlated with elevated CSF CypA levels (Fig. 41), sug- 
gesting that activation of the CypA-MMP9 pathway in APOE4 carriers 
correlates with pericyte injury, as shown in animal models”. There 
were no differences in glia or ininflammatory or endothelial cell injury 
CSF biomarkers between cognitively impaired and unimpaired APOE4 
and APOE3 participants, but there was an increase in neuron-specific 
enolase (NSE) with cognitive impairment in APOF4 carriers, confirming 
neuronal stress (Extended Data Fig. 6) and consistent with atrophy of 
the HC and PHG (Fig. lj, k). 

Studies in APOE knock-in mice and mouse pericytes have shown that 
apoE3, but not apoEF4, transcriptionally inhibits CypA via low-density 
lipoprotein receptor-related protein 1, whichinturntranscriptionally 
inhibits MMP9”. Consistent with the mouse data, pericytes derived 
from APOF4 (¢4/e4) human induced pluripotent stem cells (iPSCs) 
had substantially higher levels of CypA and secreted MMP9 than 
those derived from APOE3 (€3/e3) cells (Fig. 4m, n), suggesting that 
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Fig. 4 | Elevated CSF sPDGFRB, cyclophilin A and matrix metalloproteinase-9 
in APOE4 carriers. a, CSF sPDGFR® in individuals with CDR O bearing APOE3 
(black, n=152) or APOF4 (red, n=95) and with CDR 0.5 bearing APOE3 (black, 
n=42) or APOF4 (red, n=45).b, CSF sPDGFRf (estimated marginal 

means +s.e.m. from ANCOVA models corrected for age, sex, education, CSF 
AB,_4. and pTau status) in individuals with CDR O bearing APOE3 (black, n=152) 
or APOF4 (red, n=95) and with CDR 0.5 bearing APOE3 (black, n= 42) or APOF4 
(red, n=45).c,d, Correlation between CSF sPDGFRB and BBB K,;, an; in the HC 
(n=65;c) and PHG (n= 65; d) inAPOF4 carriers. e-g, Correlations between CSF 
sPDGFRB and albumin quotient (Q,,,,n = 92; e), fibrinogen (n= 93; f), and 
plasminogen (n=57; g) in APOEF4 carriers. h, CSF CypA in individuals with CDRO 
bearing APOE3 (black, n=75) or APOF4 (red, n= 62) and with CDR 0.5 bearing 
APOE3 (black, n=33) or APOF4 (red, n=45).i, CSF CypA (estimated marginal 
means +s.e.m. asin b) in individuals with CDR O bearing APOE3 (black, n=75) or 


apoE may control the CypA-MMP9 pathway in human pericytes in an 
isoform-specific manner, as in mouse models”. 

In APOEF4 carriers, CSF AB,_4, was reduced and CSF pTau levels were 
increased with cognitive impairment, compared to APOE3 homozy- 
gotes (Extended Data Fig. 7), as reported”; this difference remained 
significant after controlling for CSF sSPDGFR® levels (Extended Data 
Fig. 7). Together, these findings support the idea that the AB and tau 
pathways operate independently of the BBB breakdown pathway during 
the early stages of cognitive impairment in APOF4 carriers. 


APOEF4 (red, n= 62) and with CDR 0.5 bearing APOE3 (black, n=33) or APOE4 
(red, n=45).j, Correlation between CSF CypA and sPDGFRP in APOF4 carriers 
(n=96).k, CSF matrix metalloproteinase-9 (MMP%9) in individuals with CDRO 
bearing APOE3 (black, n=72) or APOF4 (red, n= 68) and with CDR 0.5 bearing 
APOE3 (black, n=33) or APOF4 (red, n=45).I, Correlation between CSF MMP9 
and CypAinAPOF4 carriers (n=104). m,n, CypA (m; see Extended Data Fig. 8) 
and secreted MMP9 in the culture medium (n) inhuman iPSC-derived APOE3 
(€3/e3) and APOF4 (€4/e4) pericytes. Mean + s.e.m. from four independent 
culture replicates. a, h,k, Continuous lines, median; dotted lines, IQR. 
Significance by ANCOVA for main effects and post hoc comparisons 
controlling for age, sex, and education. c-g,j, 1, Two-tailed simple linear 
regression; Pearson correlation coefficient (r).m,n, Unpaired two-tailed 
Student’s ¢-test. 


In summary, we have shown that BBB breakdown contributes to 
cognitive decline in APOF4 carriers independent of AD pathology; 
that high baseline CSF levels of sSPDGFR# can predict future cognitive 
decline in APOF4 carriers; and that APOF4, but not APOE3, activates the 
CypA-MMP%9 pathway in the CSF, which may lead to accelerated BBB 
breakdown and thereby cause neuronal and synaptic dysfunction”. As 
blockade of the CypA-MMP9 pathway in APOF4 knock-in mice restores 
BBB integrity and subsequently normalizes neuronal and synaptic func- 
tion”, it is possible that CypA inhibitors (some of which have been used 
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in humans for non-neurological applications”) might also suppress 
the CypA pathway in cerebral blood vessels in APOF4 carriers. This 
should improve cerebrovascular integrity, and reduce the associated 
neuronal and synaptic deficits, thereby slowing cognitive impairment. 
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Methods 


Study participants 

Participants were recruited from three sites: the University of Southern 
California (USC), Los Angeles, CA; Washington University (WashU), 
St. Louis, MO; and Banner Alzheimer’s Institute Phoenix, AZ and Mayo 
Clinic Arizona, Scottsdale, AZ as a single site. At the USC site, partici- 
pants were recruited through the USC Alzheimer’s Disease Research 
Center (ADRC): combined USC and the Huntington Medical Research 
Institutes (HMRI), Pasadena, CA. At the WashU site, participants were 
recruited through the Washington University Knight ADRC. At Ban- 
ner Alzheimer’s Institute and Mayo Clinic Arizona site, participants 
were recruited through the Arizona Apolipoprotein E (APOE) cohort. 
The study and procedures were approved by the Institutional Review 
Boards of USC ADRC, Washington University Knight ADRC, and Ban- 
ner Good Samaritan Medical Center and Mayo Clinic Scottsdale, indi- 
cating compliance with all ethical regulations. Informed consent was 
obtained from all participants before study enrolment. All participants 
(n=435) underwent neurological and neuropsychological evaluations 
performed using the Uniform Data Set (UDS)” and additional neu- 
ropsychological tests, as described below, and received a venipuncture 
for collection of blood for biomarker studies. An LP was performed in 
350 participants (81%) for collection of CSF. DCE-MRI for assessment 
of BBB permeability was performed in 245 participants (56%) who had 
nocontraindications for contrast injection. Both LP and DCE-MRI were 
conducted in 172 participants. Among the 245 DCE-MRI participants, 
74 and 96 were additionally studied for brain uptake of amyloid and 
tau PET radiotracers, respectively, as described below. No statistical 
methods were used to predetermine sample size. All biomarker assays, 
MRI, and PET scans were analysed by investigators blinded to the clini- 
cal status of the participants. 


Participant inclusion and exclusion criteria 

Included participants (>45 years of age) were confirmed by clinical and 
cognitive assessments to be either cognitively normal or at the earliest 
symptomatic stage of AD. A current or prior history of any neurologi- 
cal or psychiatric conditions that might confound cognitive assess- 
ment, including organ failure, brain tumours, epilepsy, hydrocephalus, 
schizophrenia, and major depression, was exclusionary. Participants 
were stratified by APOE genotype as APOF4 carriers (€3/e4 and €4/e4) or 
APOE4 non-carriers (€3/€3), also defined as APOE3 homozygotes, who 
were cognitively normal or had mild cognitive dysfunction, as deter- 
mined by CDR scores* and the presence of cognitive impairment in one 
or more cognitive domains based on comprehensive neuropsychologi- 
cal evaluation, including performance on ten neuropsychological tests 
assessing memory, attention/executive function, language and global 
cognition. For all analyses individuals with ¢3/e4 and €4/e4 alleles were 
pooled together in a single APOE4 group, as we did not find in the pre- 
sent cohort (82-86% €3/e4 and 14-18% ¢4/e4 participants, depending 
onthe outcome measure) a significant difference between individuals 
with two versus one €4 allele for the studied parameters, including the 
BBB K,,ans and SPDGFRP CSF values (see statistical section below). Indi- 
viduals were additionally stratified by AB and pTau CSF analysis as either 
AB,-4+ (<190 pg/ml) or AB,_4.- (190 pg/ml), and pTau+ (>78 pg/ml) 
or pTau- (<78 pg/ml), using accepted cutoff values””*”*. 

Participants were excluded if they were diagnosed with vascular cog- 
nitive impairment or vascular dementia. Clinical diagnoses were made 
by neurologists and criteria included whether the patient had a known 
vascular brain injury, and whether the clinician judged that the vascular 
braininjury played arolein their cognitive impairment, and/or pattern 
and course of symptoms. In addition to clinical diagnosis, the presence 
of vascular lesions was confirmed by moderate-to-severe white matter 
changes and lacunar infarcts by fluid-attenuated inversion recovery 
(FLAIR) MRI and/or subcortical microbleeds by T2*-weighted MRI’. 
Participants were also excluded if they were diagnosed with Parkinson’s 


disease, Lewy body dementia or frontotemporal dementia. History 
of asingle stroke or transient ischaemic attack was not an exclusion 
unless it was related to symptomatic onset of cognitive impairment. 
Participants also did not have current contraindications to MRI and 
were not currently using medications that might better account for 
any observed cognitive impairment. 


Clinical exam 

Participants underwent clinical assessments according to UDS proce- 
dures harmonized across all study sites, including clinical interview 
and review of any neurocognitive symptoms and health history with 
the participant and a knowledgeable informant. A general physical and 
neurologic exam was conducted. The CDR assessment was conducted 
in accordance with published standardization procedures, includ- 
ing standardized interview and assessment with the participant anda 
knowledgeable informant. In accordance with current diagnostic mod- 
els for cognitive and biological research criteria for cognitive impair- 
ment and AD”, participants were separately stratified by cognitive 
impairment and AD biomarker abnormality using established cutoffs 
for CSF AB,_..and pTau’*”>, Cognitive impairment was determined on 
the basis of global CDR score and neuropsychological impairment in 
one or more cognitive domains. 


Vascular risk factors 

The vascular risk factor (VRF) burden in each participant was evaluated 
through physical examination, blood tests, and clinical interviews 
with the participant and informant; history of cardiovascular disease 
(heart failure, angina, stent placement, coronary artery bypass graft, 
intermittent claudication), hypertension, hyperlipidaemia, type 2 
diabetes, atrial fibrillation, and transient ischaemic attack or minor 
stroke were investigated. The total VRF burden was defined by the sum 
of these risk factors, as previously described’. We assigned an elevated 
VRF burden to individuals with two or more VRFs. This threshold was 
adopted because previous studies showed that the presence of two 
or more VRFs is associated with occult cerebrovascular disease at 
autopsy in older adults with AD, whereas a single VRF is common and 
not necessarily associated with increased cerebrovascular disease in 
this population**®. 


Cognitive domain impairment evaluation 

Impairment in one or more cognitive domain was judged by perfor- 
mance oncomprehensive neuropsychological testing, using previously 
described neuropsychological criteria for cognitive impairment’. All 
participants underwent neuropsychological testing that included the 
UDS battery (version 2.0 or 3.0) plus supplementary neuropsychologi- 
cal tests at each site. Raw test scores were converted to age-, sex- and 
education-corrected zscores using the National Alzheimer’s Coordinat- 
ing Center (NACC) regression-based norming procedures (https://www. 
alz.washington.edu/). Normalized zscores from ten neuropsychologi- 
cal tests were evaluated in determining domain impairment, includ- 
ing three tests per cognitive domain (memory, attention/executive 
function and language) and one test of global cognition. Impairment 
in one or more cognitive domains was determined using previously 
described neuropsychological criteria, and was defined as a score >1 
s.d. below norm-referenced values on two or more tests withina single 
cognitive domain or three or more tests across cognitive domains”. 
Prior studies have established improved sensitivity and specificity of 
these criteria relative to those employing a single test score, as well 
as adaptability of this diagnostic approach to various neuropsycho- 
logical batteries***’”. Participants were excluded from cognitive domain 
analyses if they had less than 90% complete neuropsychological test 
data (53, 24, and 82 participants were excluded for MRI, PET, and CSF 
analyses, respectively). Included participants were classified as O, 1, or 
2+ based on the number of cognitive domains for which they had two 
or more impaired test scores. 
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Test battery specifics for each UDS version and recruitment site are 
as follows. i) Global cognition: MMSE for UDS version 2*8 and MoCA for 
UDS version 3”. ii) Memory: The Logical Memory Story Almmediate and 
Delayed free recall tests (modified from the original Wechsler Memory 
Scales, Third Edition (WMS-III)) for UDS version 2 and the Craft Stories 
Immediate and Delayed free recall for UDS version 3. For supplementary 
tests the USC participants underwent the California Verbal Learning 
Test, Second Edition (CVLT-II) and the Selective Reminding Test (SRT) 
sum of free recall trials. Norm-referenced scores for these supplemen- 
tary test scores were derived from a nationally representative sample 
published with the test manual (CVLT-II)* and in studies of normally 
ageing adults (SRT). iii) Attention and executive function: The Trails 
A, Trails B, and Wechsler Adult Intelligence Scale—Revised (WAIS-R) 
Digit Span Backwards tests for UDS version 2 and the Trails A, Trails B 
and Digit Span Backwards tests for UDS version 3. iv) Language: The 
Animal Fluency, Vegetable Fluency, and Boston Naming Tests for UDS 
version 2 and Animal Fluency, Vegetable Fluency, and Multilingual 
Naming Test (MINT) for UDS version 3. 


Magnetic resonance imaging and analysis 

The MRI data sets were obtained at the Mark and Mary Stevens Neuro- 
imaging and Informatics Institute of USC and Washington University of 
St. Louis. We developed a standardized high-resolution 3T MRI brain 
scan protocol. At the USC site, a Siemens 3T Prisma scanner was used 
with a product 32-channel head receive coil and body transmit coil. 
At the WashuU site, a Siemens 3T mMR with 20-channel head coil and 
Siemens 3T Vida with 64-channel head coil were used. Anatomical coro- 
nal spin echo T2-weighted scans were first obtained through the hip- 
pocampi (TR/TE 8020/50 ms, NEX =1, slice thickness 2mm with 2mm 
gap between slices, FOV =175 x 175 mm, matrix size = 448 x 448). Base- 
line coronal T1-weighted maps were then acquired using a T1-weighted 
3D volumetric interpolated breath-hold (VIBE) sequence and variable 
flip angle method using flip angles of 2°, 5°, 10°, 12°, and 15°. Coronal 
DCE-MRI covering the hippocampi and temporal lobes was acquired 
using a T1-weighted 3D VIBE sequence (FA = 15°, TR/TE=5.14/2.18 ms, 
NEX = 1, slice thickness 5 mm with no gap, FOV 175 x 175 mm, matrix 
size 320 x 320, voxel size 0.550 x 0.550 x 5 mm). This sequence was 
repeated for a total of 16 min with an approximate time resolution of 
15.4 s. Gadolinium-based contrast agent (GBCA), gadoterate meglu- 
mine (Dotarem, Guerbet, France) (0.05 mmol/kg), was administered 
intravenously into the antecubital vein using a power injector, ata 
rate of 3 ml/s followed by a 25-ml saline flush, 30 s into the DCE scan. 

The standardization and optimization of the MRI protocol required 
several tests performed ona phantom. Specifically, scanner charac- 
terization and calibration sequences including Bo, T1, and variable 
flip-angle mapping were implemented, optimized, and applied. After 
the achievement of good results in terms of quality control and repro- 
ducibility, we standardized and employed the same pre-contrast and 
dynamic T1-weighted protocols at both USC and Washington University 
sites. Of note, all the other MR sequences were also identical on both 
scanners. 

In order to minimize inter-site variability, the entire MRI protocol 
including the anatomical and DCE pulse sequences were 100% mir- 
rored from one site to another and the same contrast agent gadoter- 
ate meglumine (Dotarem) was injected into participants at the same 
concentration (0.05 mmol/kg). Finally, exactly the same pre- and 
post-processing analysis pipeline was applied for both sites, including 
T1 multi-FA mapping using linear fitting and Patlak-based DCE model- 
ling using the arterial input function determined in each individual 
from the internal carotid artery. Applying all the above cited factors 
greatly limited inter-site variability. 

The consistency of the results from the two sites was additionally 
confirmed by our previous publication’. In brief, we performed the 
analysis of the combined DCE data sets from both USC and WashU 
sites, and additionally site-specific analysis for each of the two sites 


separately, which showed nostatistically significant differences across 
sites. Recently, we invited a subset of 52 participants for an additional 
T1-weighted scan without contrast (using the same scanner and same 
MR pulse sequences) after their first DCE-MRI“! and measured both B, 
and T1 values at atwo-year interval. This study showed that the results 
were unchanged and consistent across the scans, supporting minimal 
intra-site variability. 


Quantification of BBB permeability 
See Supplementary Information, Supplementary Methods. 


Quantification of regional brain volumes 

HC and PHG morphometry were performed using the FreeSurfer 
(v5.3.0) software package” (http://surfer.nmr.mgh.harvard.edu/), as 
previously described’. The HC and PHG were segmented using Free- 
Surfer Desikan-Killiany and subcortical atlases“. Then, regional vol- 
umes (mm?) were derived accordingly. The technical details of this 
procedure have been described previously***. Data processing and 
visualization were performed using the Laboratory of Neuro Imaging 
(LONI) pipeline system (http://pipeline.loni.usc.edu) and Quantitative 
Imaging Toolkit*””. 


Positron emission tomography and analysis 

PET image acquisition was performed at the Molecular Imaging Center 
of USC or Mallinckrodt Institute of Radiology of WashU. Amyloid 
and tau PET studies were conducted using '8F-florbetaben (FBB) or 
18F-florbetapir (FBP) and ®F-flortaucipir (AV1451), respectively. FBB (Life 
Molecular Imaging, Inc.) was obtained from SOPHIE, Inc. for the USC 
site, while FBP was provided by Eli Lilly and Company for the WashU site. 
For all amyloid PET analysis the FBP and FBB data sets were combined. 
AV1451 was provided by Avid Radiopharmaceuticals, Inc. for the USC 
site and was produced by the Mallinckrodt Institute of Radiology for 
the WashU site. A Siemens Biograph 64 PET scanner was used at the 
USC site. At the WashU site, FBP scans were acquired on a Siemens 
mMRand AV1451 scans were acquired ona Siemens Biograph mCT. The 
mCT session was used for attenuation correction of the mMR scans. 
Participants were injected with 300 MBq (+10%) of FBB or 370 MBq 
(+10%) of FBP. FBB and FBP images were acquired from 90 to 110 min 
and 50 to 70 min, respectively, after injection in accordance with the 
manufacturers’ recommendations. Individuals who participated in 
amyloid and tau PET studies also had their DCE-MRI scan within 2.2+ 0.9 
and 2.1+ 0.6 months of their amyloid and tau PET scans, respectively. 

In brief, acomputed tomography (CT) scan was performed first for 
attenuation correction before each PET imaging session. The down- 
loaded PET images from FBB, FBP, and AV1451 tracers were processed 
by using standard uptake value maps (SUV in g/ml). All PET images 
were co-registered to structural high-resolution 3D T1-weighted Mag- 
netization Prepared Rapid Acquisition Gradient Echo (MP-RAGE) MRI 
images using FSL-FLIRT (FMRIB’s Linear Image Registration Tool)”. 
The FreeSurfer-segmented cerebellum was used as a reference tissue 
to normalize for both amyloid and tau*. 

After co-registration of PET images into an anatomical reference 
image (MNI152 standard-space), Statistical Parametric Mapping 
(SPM12) was used for group comparison ona voxel-by-voxel basis. Age 
at time of PET imaging session, sex, and education were introduced in 
amultiple regression model as covariates. Level of significance was set 
to P< 0.001 for amyloid and P< 0.005 for tau (uncorrected P values) 
with the minimum number of voxels (Ke) ina cluster of 50. 

Additionally, given the known AV1451 off-target ligand binding inthe 
choroid plexus (CP)***?, which can contribute to HC regional AV1451 
signal owing to the close proximity of the CP to the HC and relatively 
low spatial resolution of PET scans (that is, ~6-mm voxel size), we took 
advantage of visualizing CP by DCE-MRI, also performed in these indi- 
viduals, which allowed us to subtract the contribution of the CP signal 
tothe HC AV1451 proper signal. The following steps were used to correct 


for off-target ligand binding to the CP (see Extended Data Fig. 5). Step 1: 
HC masks were generated from the 3D T1-weighted MP-RAGE. Step 2: CP 
masks were generated from the T1-weighted VIBE post-GBCA (FA=15°) 
image. Step 3: HC and CP masks were overlaid. Step 4: The overlap of 
the CP and HC masks was subtracted to obtain a CP-corrected HC PET 
signal after adding 6-mm voxel size on top of the CP mask generated 
from DCE data. Representative images of HC AV1451 PET signal before 
and after applying the CP correction are shown in Extended Data Fig. 5. 

We next quantified regional changes in amyloid and tau SUV ratio 
(SUVR) in relation to regional DCE-MRI K,,,,; Values in all participants 
stratified by APOE genotype. The regional SUVR values were taken from 
the FreeSurfer-segmented HC, PHG, OFC’, and ITG”’. The BBB K¢rans 
constant (DCE-MRI) was determined in all participants (Extended Data 
Tables 2a, b). This includes those who were analysed for both amyloid 
and tau (n= 58), only amyloid (n = 9) or only tau (n= 29). 


Lumbar puncture and venipuncture 

Participants underwent a lumbar puncture and venipuncture in the 
morning after an overnight fast. The CSF was collected in polypropylene 
tubes, processed (centrifuged at 2,000g, 4 °C, 10 min USC site; 5 min 
Washu site), aliquoted into polypropylene tubes and stored at —80 °C 
until assay. Blood was collected into EDTA (EDTA) tubes and processed 
(centrifuged at 2,000g, 4 °C, 10 min USC site; 5 min WashU site). Plasma 
and buffy coat were aliquoted in polypropylene tubes and stored at 
-80 °C; buffy coat was used for DNA extraction and APOE genotyping. 


APOE genotyping 

DNA was extracted from buffy coat using the Quick-gDNA Blood 
Miniprep Kit (catalogue no. D3024, Zymo Research, Irvine, CA). APOE 
genotyping was performed via polymerase chain reaction (PCR)-based 
retention fragment length polymorphism analysis, as previously 
reported’. 


Molecular assays 

Quantitative western blotting of sSPDGFRB. The quantitative western 
blot analysis was used to detect SPDGFRB in human CSF (ng/ml), as 
previously reported’®. 


BBB breakdown biomarkers. Albumin quotient (Q,,,, the ratio of CSF 
to plasma albumin levels) and CSF levels of fibrinogen and plasminogen 
were determined using enzyme-linked immunosorbent assay (ELISA), 
as previously reported’®. 


Cyclophilin A. We developed a novel CypA assay on the Meso Scale 
Discovery (MSD) platform. Standard-bind 96-well plates (catalogue 
no. LISXA-3/L11XA-3, MSD, Rockville, MD) were spot-coated with 5 ul 
per well of 40 pg/ml rabbit polyclonal anti-CypA antibody (catalogue 
no. 10436-T52, Sino Biological, Wayne, PA) prepared in 0.03% Triton 
X-100 in 0.01M PBS pH 7.4 solution. The plates were left undisturbed 
overnight to dry at room temperature. The next day, the plates were 
blocked with 150 ul per well of Blocking One (catalogue no. 03953-95, 
Nacalai Tesque, Japan) and incubated for exactly 1 h with shaking. 
Meanwhile, samples and standards were prepared in Blocking One 
blocking buffer. Different concentrations ranging from 3.5 to 200 ng/ml 
of arecombinant human CypA protein (catalogue no. 3589-CAB, R&D 
Systems, Minneapolis, MN) were used to generate a standard curve. 
All CSF samples were diluted 1:3. After blocking, the plates were manu- 
ally washed three times with 200 pl per well of wash buffer (in 0.05% 
Tween-20 in 0.01 M PBS pH 7.4). The prepared samples or standards 
were added at 25 ul per well, and the plates were incubated overnight 
at 4 °C with shaking. 

The next day, the plates were washed three times, and 25 pl per well 
of 1 pg/ml sulfo-tagged mouse monoclonal CypA detection antibody 
(catalogue no. ab58144, Abcam, Cambridge, MA), prepared in Block- 
ing One. The plates were incubated for 90 min at room temperature 


with shaking. Next, the plates were washed four times, then 150 pl per 
well of 2x Read Buffer T with surfactant (catalogue no. R92TC-3, MSD, 
Rockville, MD) was added and the plates were read immediately onan 
MSD SECTOR Imager 6000 (MSD, Rockville, MD) with electrochemi- 
luminescence detection. 

The raw readings were analysed by subtracting the average back- 
ground value of the zero standard from each recombinant standard 
and sample reading. A standard curve was constructed by plotting 
the recombinant standard readings and their known concentrations 
and applying a nonlinear four-parameter logistics curve fit. The CypA 
concentrations were calculated using the samples’ reading and the 
standard curve equation; the result was corrected for the sample dilu- 
tion factor to arrive at the CypA concentration in the CSF samples. 


Matrix metalloproteinase-9. CSF levels of MMP9 were determined us- 
ing the human MMP9 Ultra-Sensitive Kit from MSD (cat. No. K1S1HAC). 


Neuron-specific enolase. CSF levels of NSE were determined using 
ELISA (cat. no. E-SONEN, Immunology Consultant Laboratories, Port- 
land, OR). The company no longer sells this product; thus, this analyte 
was measured in the majority of participants but notin those individuals 
that enrolled in the study most recently. 


S100B. CSF levels of the astrocyte-derived cytokine, S100 
calcium-binding protein B (S100B), were determined using ELISA (cat. 
no. EZHS100B-33K, EMD Millipore, Billerica, MA). 


Inflammatory markers. An MSD multiplex assay was used to deter- 
mine CSF levels of intercellular adhesion molecule 1 (ICAM1) (cat. no. 
K15198D, MSD, Rockville, MD), and interleukin-6 (IL6), IL-1B, tumour ne- 
crosis factor-a (TNFa), and interferon gamma (IFNy) (cat. no. K15049G, 
MSD, Rockville, MD). 


AB peptides. An MSD multiplex assay (cat. no. K15200E, MSD, Rock- 
ville, MD) was used to determine CSF levels of AB,_,5. Participants were 
stratified based on CSF analysis as either AB+ (<190 pg/ml) or AB- (>190 
pg/ml) using the accepted cutoff values as previously reported for the 
MSD 6E10 Af peptide assay”. 


Tau. Phosphorylated tau (pT181) was determined by ELISA (cat. no. 
81581, Innotest, Fujirebio US, Inc., Malvern, PA). Participants were strati- 
fied based on CSF analysis as either pTau+ (>78 pg/ml) or pTau- (<78 
pg/ml), using the accepted cutoff value as previously reported”. 


Human iPSCs 

iPSC lines were generated by reprogramming of skin fibroblasts 
from APOE ¢4/¢4 and APOE ¢3/¢3 donors with AD or without (con- 
trol) as recently reported**. Reprogramming was performed using 
integration-free Sendai virus vectors and we passaged cells to pas- 
sage 15 and confirmed normal karyotype. hiPSCs were maintained on 
Matrigel (Corning) in mTeSR1 (catalogue no. 85850, StemCell Tech- 
nologies, Vancouver, BC, Canada) supplemented with 10 ng/ml FGF2 
StemBeads (StemCultures) or mTeSR plus (StemCell Technologies) 
every other day. 


Differentiation of iPSCs into pericytes 

Differentiation of iPSCs into pericytes was carried out as described previ- 
ously. In brief, iPSCs were dissociated with ReLeSR (catalogue no. 05872, 
StemCell Technologies) and seeded at 55,000 cells/cm? in Essential 8 
medium (catalogue no. A1517001, ThermoFisher, Waltham, MA, USA) 
supplemented with ROCK inhibitor Y-27632 (10 iM, catalogue no. 72304, 
StemCell Technologies) on Matrigel (0.5 mg/6-well plate, catalogue no. 
354230, Corning, NY, USA). After 24 hincubation, the iPSCs were switched 
into STEMdiff Mesoderm Induction Medium (MIM, catalogue no. 05221, 
StemCell Technologies) for 5 days with daily medium change. On day 


Article 


6 of MIM treatment, the cells were plated on Matrigel at 25,000 cells/ 
cm’ in pericyte medium (catalogue no. 1201, ScienCell, Carlsbad, CA, 
USA) for an additional 7 days. The differentiated cells were dissociated 
with Accutase (catalogue no. 07920, StemCell Technologies). Following 
incubation with human PDGFR® biotinylated antibody (catalogue no. 
BAF385, R&D Systems, Minneapolis, MN, USA), the cells were incubated 
with anti-biotin microbeads (catalogue no. 130-090-485, Miltenyi Biotec, 
Bergisch Gladbach, NRW, Germany) and magnetically sorted using MACS 
LS columns (catalogue no. 130-042-401, Miltenyi Biotec) following the 
manufacturer’s instructions. Sorted pericytes were plated at a density of 
25,000 cells/cm’ on Matrigel-coated coverslips for immunocytochem- 
istry analyses or poly-L-lysine-coated six-well culture plates for western 
blot analyses. Differentiated pericytes were positive for the pericyte 
markers PDGFRB, CD13, and NG2, and negative for the endothelial marker 
CD31, the astrocytic marker GFAP, and the microglial marker CD11b. 


Statistical analyses 

Prior to performing statistical analyses, we first screened for outliers 
using the Grubbs test, also called the ESD (extreme studentized devi- 
ate) method, applying a significance level of a = 0.01 (https://www. 
graphpad.com/quickcalcs/grubbs1/). For each of the outliers identified, 
a secondary index of outlier influence was applied using the degree of 
deviation from the mean (greater than +3s.d.)*°. Using these stringent 
criteria, a total of five outliers (one each in Figs. 1j, kand 2j and oneeach 
in Extended Data Fig. 6a, b, were removed from analyses, as indicated in 
the legends of these figures. Continuous variables were also evaluated 
for departures from normality through quantitative examination of 
skewness and kurtosis, in addition to visual inspection of frequency 
distributions. Where departures of normality were identified, log,, trans- 
formations were applied, and distribution normalization was confirmed 
before parametric analyses. This was done for Fig. 4h, k and Extended 
Data Fig. 7a, b, d, e. As the use of log,, transformations accounts for any 
non-normality, this obviated the need for outliers exclusion. 


DCE-MRI K,,n< and CSF sPDGFRB and CypA. Regional DCE-MRI K rans 
values and CSF sPDGFRB, CypA and MMP9 levels were compared across 
the entire sample stratified by APOE status. As in the APOF4 group 
relatively few participants were homozygous €4/¢4 compared to het- 
erozygous €3/e4 (14% for DCE-MRI analysis, and 18% for sSPDGFRB analy- 
sis), and initial comparisons between €4/e4 and €3/e4 carriers did not 
showany significant differences in regional HC and PHG DCE-MRI K;rans 
values (CDR O, Pic = 0.19 and Ppyg= 0.54 (PHG); CDR 0.5, P,-= 0.22. and 
Poy = 0.84) or CSF SPDGFR® levels (CDR O, P=0.23; CDR 0.5, P=0.47), 
all subsequent analyses combined APOEF4 carriers (€3/e4 and €4/€4), 
and compared these participants to APOE3 carriers (€3/€3) stratified 
by cognitive impairment status (CDR O versus 0.5 and O versus 1 versus 
2+ cognitive domain impairment using ANCOVA with FDR correction 
for multiple comparisons (see details below). For CDR analyses, model 
covariates included age, sex, and education. Cognitive domain impair- 
ment was determined using age-, sex-, and education-corrected values, 
so these covariates were not additionally included in the analyses. 
Additional post hoc ANCOVA analyses evaluated whether the observed 
differences remained significant after stratifying APOE4 carriers by CSF 
AB,_,. and pTau status, and after statistically controlling for CSF AB,_4> 
and pTau status and regional brain volume in APOF4 non-carriers and 
carriers. These findings were also confirmed by hierarchical logistic 
regression models using the same covariates. 


PET AD biomarkers. Ina subset of participants who underwent amyloid 
and tau PET imaging together with DCE-MRI studies, we used ANCOVA 
models controlled for age, sex and education to compare regional 
amyloid and tau ligand binding and DCE-MRI values in a set of APOE4 
non-carriers and carriers within a priori regions of interest, based on 
prior imaging studies, to determine whether distinct regional patholo- 
gies differed by APOF4 carrier status. 


Baseline CSF sPDGFR§ as a continuous predictor of cognitive 
decline. For linear mixed model analysis, baseline CSF sSPDGFRB was 
a continuous predictor of demographically corrected global cogni- 
tive change at 2-year follow up intervals, controlling for CSF AB,_4>5 
and CSF pTau status. Global cognition was indexed by age-, sex-, 
and education-corrected z scores on mental status exam (MMSE or 
MoCA) and as the global cognitive composite of all age-, sex-, and 
education-corrected neuropsychological test z scores (see above for 
list of neuropsychological tests). Time was modelled with date of LP 
as baseline (tO) with two follow-up intervals of 2 years each (tl, t2). 
Additional analyses confirmed all findings when time was modelled 
as time since baseline, with date of lumbar puncture as baseline (tO) 
and follow up as annual intervals (t1—n). 

Alllongitudinal mixed models treated CSF sSPDGFRf as a continuous 
predictor. Although we have previously established that CSF sPDGFRB 
isamarker of pericyte injury’®”, the optimal cutoff value for abnormal 
CSF sPDGFRf levels indicative of pericyte injury remains unknown. 
Autopsy studies are required to determine optimal in vivo biomarker 
cutoff values that predict gold-standard neuropathological measures, 
such as studies conducted for CSF and PET markers of amyloid and 
tau. Given the lack of available autopsy data relating CSF sPDGFRB to 
neuropathological markers of pericyte injury, we chose to divide par- 
ticipants by CSF sPDGFR® values using median split for the purposes 
of visual display only (higher CSF sSPDGFRB was above sample median 
and lower CSF sPDGFR® was below sample median). The median split 
was not used in statistical analyses and was only used for the purpose 
of visual display (Fig. 3a) for statistical parameters from analyses using 
CSF sPDGFR® as a continuous predictor of cognitive decline). 


Correlational analyses. Pearson product moment correlations were 
used to evaluate relationships among CSF sPDGFRB, CypA, MMP39, 
fibrinogen, plasminogen and hippocampal and parahippocampal BBB 
Kans levels among APOF4 carriers. 


Multiple comparison correction and missing data. Given the large 
number of analyses, FDR correction was applied to Pvalues for primary 
study outcomes (DCE-MRI, sPDGFR) evaluated in the entire sample 
by APOF4 carrier status and CDR status using the Benjamini-Hoch- 
berg method*’ in ANCOVA and logistic regression models controlling 
for age, sex, education, brain volume, and CSF AB,_,, and pTau status 
(for DCE-MRI analyses). Post hoc confirmatory analyses in participant 
subsets further evaluating independence of CSF and PET markers of 
amyloid and tau, evaluation of mechanistic markers (that is, CypA and 
MMP9), and longitudinal analysis of predictive value of CSF sPDGFRB 
were not corrected for multiple comparisons. For longitudinal data 
with variable follow up, we used linear mixed model analyses with and 
accounted for missing data via the missing at random assumption. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 

All data generated and/or analysed during this study are either included 
in this article (and its Supplementary Information) or are available 
from the corresponding author on reasonable request. Source Data 
for Figs. 1-4 are provided with the article. 


Code availability 


All software used in this study are publicly available: Rocketship v1.2 
(https://github.com/petmri/ROCKETSHIP/blob/master/dce/com- 
pare_gui.m), FreeSurfer (v5.3.0) (http://surfer.nmr.mgh.harvard.edu/), 


FSL-FLIRT (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/FLIRT), SPM12 (https:// 
www.fil.ion.ucl.ac.uk/spm/software/spm12/), and Quantitative Imaging 
Toolkit (https://cabeen.io/about/publication/cabeen2018quantitative/). 
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and internal capsule (IC, h) in individuals with CDR O bearing APOE3 (black, 
n=128) and APOF4 (red, n= 68), and with CDR 0.5 bearing APOE3 (black, n=14) 
and APOE4 (red, n=25). Continuous lines, median; dotted lines, IQR. 
Significance by ANCOVAs for main effects and post hoc comparisons 
controlling for age, sex, and education. 


Extended Data Fig. 1| Regional BBB X,,,,, constant in eight additional brain 
regionsin APOEF4 carriers and non-carriers (APOE3) with CDR status 0 and 
0.5. BBB K,,..,; constant in the ITG (a), superior frontal gyrus (SFG, b), caudate 
nucleus (CN, c), thalamus (Thal, d), striatum (Str, e), subcortical watershed 
normal-appearing white matter (Subcort. WS NAWM, f), corpus callosum (CC, g), 
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Extended Data Fig. 2| BBB breakdown in the HC and PHG in APOF4 carriers and PHG volumes) in the HC (c) and PHG (d) in individuals with no cognitive 
increases with cognitive domain impairment.a,b, K,,,,, constant in the HC domains impaired bearing APOE3 (n=70) or APOF4 (n=40); one cognitive 
(a) and PHG (b) in individuals with no cognitive domains impaired bearing domain impaired bearing APOE3 (n=18) or APOF4 (n= 21); and two or more 
APOE3 (black, n=70) or APOF4 (red, n= 40); one cognitive domain impaired cognitive domains impaired bearing APOE3 (n=7) or APOF4 (n=12). 
bearing APOE3 (n=18) or APOE4 (n= 21); and two or more cognitive domains Significance by ANCOVA for main effects and post hoc comparisons 
impaired bearing APOE3 (n=7) or APOF4 (n=12). Continuous lines, median; controlling for age, sex, and education. All ANCOVA omnibus tests remained 
dotted lines, IQR.c, d, K,,an; (estimated marginal mean +s.e.m.from ANCOVA significant at FDR threshold of 0.05. 
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Extended Data Fig. 3| Regional BBB K,,,,, constant in eight additional brain 
regions inAPOE4 carriers and APOE3 carriers with different degrees of 
cognitive domain impairment. K,,,,,, constant in the ITG (a), SFG (b), CN (c), 
thalamus (d), striatum (e), subcortical WS NAWM (f), CC (g), and IC (h) in 
individuals with no cognitive domains impaired bearing APOE3 (black, n=70) 
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or APOF4 (red, n=40): one cognitive domain impaired bearing APOE3 (n=18) or 
APOE4 (n= 21); and two or more cognitive domains impaired bearing APOE3 
(n=7) or APOF4 (n=12). Continuous lines, median; dotted lines, IQR. 
Significance tests from ANCOVAs for main effects and post hoc comparisons 
controlling for age, sex, and education. 
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Extended Data Fig. 4 | Regional BBB K,,.,, constant in all studied brain 
regions in APOE4 carriers and APOE3 carriers in relation to vascular risk 
factors. K,,,,,; constant in the HC (a), PHG (b), ITG (c), SFG (d), CN (e), thalamus 
(f), striatum (g), subcortical WS NAWM (h), CC (i), and IC (j) in APOE3 (green, 


Subcort. WS NAWM 


(x 10° min) 


‘trans 


BBB K, 


p=0.11 


0) 


n=80) and APOF4 (brown, n=42) carriers with O-1 vascular risk factors (VRFs), 
and APOE3 (n=58) and APOE4 (n=51) carriers with 2+ VRFs. Continuous lines, 
medians; dotted lines, IQR. Significance by ANCOVAs for main effects and post 
hoc comparisons controlling for age, sex, and education. 
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Extended Data Fig. 5 | Amyloid and tau PET analysis in APOF4 carriers and 
correction of “F-AV1451 off-target binding in the choroid plexus. All studies 
were performed in individuals with CDR score 0. Amyloid and tau PET studies 
were conducted using '*F- FBB or 8F- FBP, and 8F- AV1451, respectively. For 
amyloid PET data analysis, FBP and FBB data sets were combined. a, Uptake of 
amyloid tracers by the OFC in APOF4 (n= 29) relative to APOF3 (n=45) carriers 
(voxel-wise two-sample one-tailed t-tests). b, Representative amyloid PET SUVR 
maps from an APOE3 homozygote (top) and an APOF4 carrier (APOE4) 
(bottom). Slices 1and 2, regions of interest (ROIs) for amyloid PET and BBB 
DCE-MRI scans (see e). Arrow, amyloid tracer uptake by OFC. The APOF3 and 
APOF4 representative images used FBP. c, Uptake of tau tracer shows 
undetectable tau accumulation in APOE3 (n= 60) or APOF4 (n=37) carriers 
(voxel-wise two-sample one-tailed t-tests). d, Representative tau PET SUVR 


maps from APOE3 (top) and APOF4 (bottom) carriers. Slices 1and 1’, ROIs for tau 
PET and BBB DCE-MRIscans, respectively (see e).e, Coronal 3D scans of regions 
studied in Fig. 2: HC (red), PHG (green), medial OFC (yellow), and ITG (blue). 

f, Correction of 8F-AV1451 off-target binding in the choroid plexus. Step 1, HC 
masks were generated from the 3D T1-weighted magnetization prepared-rapid 
gradient echo (MP-RAGE). Step 2, CP masks were generated from the 
T1-weighted VIBE post-GBCA image (flip angle, 15°). Step 3, HC and CP masks 
were overlaid (arrowheads, red). Step 4, areas of CP overlap with HC masks 
(arrowheads, yellow) were subtracted to obtain CP-corrected HC tau PET signal 
after adding 6-mm voxel size on top of CP mask generated from DCE data. 

g, Representative images of HC tau PET signal before (top) and after (bottom) 
applying the CP correction (arrows and white dotted lines show overlap 
between HC and CP). 
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Extended Data Fig. 6 | CSF biomarkers of glia and inflammatory response 
and endothelial and neuronal cell injury in APOE4 and APOE3carriers. 

a, CSF astrocytic S1OOB levels in individuals with CDR O bearing APOE3 (black, 
n=77) or APOF4 (red, n= 41), and with CDR 0.5 bearing APOE3 (n= 39) or APOE4 
(n=32).b, CSF IL6 levels in individuals with CDR O bearing APOE3 (n=71) or 
APOE4 (n= 47), and with CDR 0.5 bearing APOE3 (n= 34) or APOF4 (n=32).c, CSF 
IFNy levels in individuals with CDR O bearing APOE3 (n= 54) or APOF4 (n=29), 
and with CDR 0.5 bearing APOEF3 (n= 25) or APOF4 (n=17).d, CSF ILI levels in 
individuals with CDR O bearing APOE3 (n= 43) or APOF4 (n=18), and with CDR 
0.5 bearing APOE3 (n=17) or APOF4 (n=13). (e) CSF TNFa levels in individuals 
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with CDR O bearing APOE3 (n= 70) or APOE4 (n= 46), and with CDR 0.5 bearing 
APOE3 (n=34) or APOF4 (n= 32). f, CSF soluble intercellular adhesion molecule1 
(sSICAMI) levels in individuals with CDR O bearing APOE3 (n=77) or APOE4 
(n=40), and with CDR 0.5 bearing APOE3 (n= 39) or APOF4 (n=33).g, CSF NSE 
levels in individuals with CDR 0 bearing APOE3 (n= 47) or APOE4 (n= 32), and 
with CDR 0.5 bearing APOE3 (n = 29) or APOE4 (n=29). Continuous lines, 
median; dotted lines, IQR. aandb had one outlier each, which were removed 
before statistical analysis (see Methods). Significance by ANCOVAs for main 
effects and post hoc comparisons controlling for age, sex, and education. 
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Extended Data Fig. 7 | Decreased CSF AB,_,. and increased pTau levelsin 
APOE4 carriers with cognitive impairment. a, CSF AB,_,. levels in individuals 
with CDR O bearing APOE3 (black, n= 141) or APOF4 (red, n= 83) and with CDR 
0.5 bearing APOE3 (n=39) or APOEF4 (n= 41). b, CSF AB,-42 levels in APOE3 (n= 89) 
and APOF4 (n=55) carriers with no cognitive domains impaired, APOE3 (n=29) 
and APOF4 (n= 31) carriers with one cognitive domain impaired, and APOE3 
(n=17) and APOF4 (n=14) carriers with two or more cognitive domains 
impaired.c, CSF AB,.,, levels (estimated marginal means +s.e.m. from ANCOVA 
models corrected for age, sex, education, and CSF sPDGFRf levels) in 
individuals with CDR O bearing APOE3 (n= 141) or APOF4 (n=83) and withCDR 
0.5 bearing APOE3 (n=39) or APOE4 (n= 41). d, CSF pTau levels in individuals 
with CDR O bearing APOE3 (n= 141) or APOF4 (n= 82) and with CDR 0.5 bearing 


APOE3 (n= 39) or APOEF4 (n=43).e, CSF pTau levels in APOE3 (n= 89) and APOE4 
(n=56) carriers with no cognitive domains impaired, APOF3 (n=29) and APOE4 
(n=30) carriers with one cognitive domain impaired, and APOE3 (n=17) and 
APOE4 (n=15) carriers with two or more cognitive domains impaired. f, CSF 
pTau levels (estimated marginal means +s.e.m. from ANCOVA models 
corrected for age, sex, education, and CSF sPDGFRf levels) in individuals with 
CDR 0 bearing APOE3 (n=141) or APOF4 (red, n= 82) and with CDR 0.5 bearing 
APOE3 (n= 39) or APOF4 (n= 43). Violin plots: continuous lines, median; dotted 
lines, IQR. CSF AB,_,. and pTau values were log, )-transformed before statistical 
analysis because they hada non-normal distribution. Significance tests from 
ANCOVAs for main effects and post hoc comparisons controlling for age, sex, 
and education. 
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Extended Data Fig. 8 | Full scans of western blots. Full scans of western blots for CypA shown in Fig. 4m (top). 
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Extended Data Table 1| APOE3 and APOE4 carriers studied for regional BBB permeability changes by DCE-MRI 


APOE Genotype APOE3 APOE4 APOE3 APOE4 
Clinical Dementia Rating (CDR) 0 0 05 0.5 
No. of participants 130 76 14 25 
Age at MRI, years, Mean (SD) 69.9 (7.9) 67.3 (8.7) 73.8 (8.3) 69.4 (8.7) 
Female, % 62.3 57.9 42.9 56 
Education, years, Mean (SD) 16.6 (2.7) 16.7 (2.0) 16.4 (2.5) 17.1 (2.1) 
Cognitive domain impairment, No. 0, 1, 2+ 78, 17, 2 38, 16, 4 6, 2,6 6, 8,9 
Vascular risk factors, No. 0-1, 2+ 77, 53 38, 38 8,6 10, 15 


Extended Data Table 2 | APOE3 and APOE4 carriers studied for regional amyloid or tau brain accumulation by PET and BBB 


permeability changes by DCE-MRI 


APOE Genotype APOE3 APOE4 
Clinical Dementia Rating (CDR) 0 0 
No. of participants 45 29 
No. of participants (FBB, FBP) 5, 40 9, 20 
Age at amyloid PET, years, Mean (SD) 68.4 (7.5) 65.7 (8.8) 
Female, % Too 60 
Education, years, Mean (SD) 16.7 (2.7) 16.5 (2.1) 
Cognitive domain impairment, No. 0, 1, 2+ 24, 4,0 17,4, 1 
Vascular risk factors, No. 0-1, 2+ 23, 22 19, 10 


APOE Genotype 


Clinical Dementia Rating (CDR) 


No. of participants 

Age at tau PET, years, Mean (SD) 

Female, % 

Education, years, Mean (SD) 

Cognitive domain impairment, No. 0, 1, 2+ 


Vascular risk factors, No. 0-1, 2+ 


60 
68.7 (7.9) 
66.6 
16.5 (2.7) 
27, 5,1 
36, 24 


37 
64.0 (8.4) 
37.8 
16.6 (2.1) 
15, 6, 1 
23, 14 


a, Participants studied for amyloid brain accumulation; b, participants studied for tau accumulation. FBB, participants who received '*F-Florbetaben; FBP, participants who received 


"®F-Florbetapir. 
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Extended Data Table 3 | APOE3 and APOE4 carriers studied for CSF sPDGFR levels 


APOE Genotype APOE3 APOE4 APOE3 APOE4 
Clinical Dementia Rating (CDR) 0 0 0.5 0.5 
No. of participants 157 105 40 48 
Age at LP, years, Mean (SD) 70.2 (8.9) 67.3 (9.9) 76.5 (7.3) 72.8 (8.1) 
Female, % 64 61 39 37 
Education, years, Mean (SD) 16.5 (2.7) 16.3 (2.3) 15.7 (2.8) 16.2 (2.8) 
Cognitive domain impairment, No. 0, 1, 2+ 91, 20,2 62, 17,2 9, 8, 14 12, 16, 15 
Vascular risk factors, No. 0-1, 2+ 100, 57 57, 48 22, 18 18, 30 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


4) A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 
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For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 
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Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection The data collection software used were SYNGO (console software) and VE11C (MRI software) for MRI, and PETSyngo 6.7.3 for PET. 


Data analysis Our Rocketship GUI code v1.2 (https://github.com/petmri/ROCKETSHIP/blob/master/dce/compare_gui.m) running with Matlab R2019a 
was used for DCE-MRI analyses. We also used FreeSurfer (v5.3.0) software package (http://surfer.nmr.mgh.harvard.edu/) for regional 
brain volume analyses. We used FSL-FLIRT (FMRIB's Linear Image Registration Tool) for PET-MR co-registration (https://fsl.fmrib.ox.ac.uk/ 
fsl/fslwiki/FLIRT), and Statistical Parametric Mapping (SPM12) was used for PET group comparison in a voxel-by-voxel basis (https:// 
www-.fil.ion.ucl.ac.uk/spm/software/spm12/). Quantitative Imaging Toolkit was used for data visualization (https://cabeen.io/about/ 
publication/cabeen2018quantitative/). Statistical analyses were conducted with a commercial statistical software package - SPSS (IBM) 
v26. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


A full data availability statement is included in the manuscript. The data that support the findings of this study are available from the corresponding author upon 
reasonable request. See Data Availability statement pasted here: "All data generated and/or analyzed during this study are either included in this article (and its 
Supplementary Information) or are available from the corresponding author on reasonable request. Source Data for Figs. 1-4 are provided with the article". 
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Sample size 


Data exclusions 


Replication 


Based on prior studies demonstrating very large effect sizes for blood-brain barrier markers relating to cognitive impairment (see Montagne 
et al., Neuron 2015 and Nation et al., Nat Med 2019), we anticipated needing a minimum sample size of 10-20 participants per subgroup 
(APOE4 status x CDR status) indicating a minimum total sample size of 40-80 to be sufficient to detect significant blood-brain barrier marker 
effects on cognitive status in all participant subgroups. Consistent with these prior studies, very large effect sizes were observed for all 
primary study outcomes (up to n2 > .32). Based on these observations, a post-hoc power analysis for uncorrected comparisons indicated 
>99% power to detect significant differences, noncentrality parameter, delta = 5.99, critical t = 1.99, for an a = .05). In another post-hoc 
power analysis for corrected comparisons (ANCOVA with 5 covariates), we again found >99% power to detect significant differences 
(noncentrality parameter, A = 45.57, critical F = 1.9, for an a = .05). Post-hoc power analyses were conducted using a general power analysis 
software program (i.e., G*Power). 


Prior to performing statistical analyses, we first screened for outliers using the Grubbs’ test, also called the ESD method (Extreme Studentized 
Deviate), applying a significance level of a=0.01 (https://www.graphpad.com/quickcalcs/grubbs1/). For each of the outliers identified, a 
secondary index of outlier influence was applied using the degree of deviation from the mean (greater than +/- 3 SDs). Using these stringent 
criteria, a total of 5 outliers one each in main Figure panels 1j, 1k, and 2j, and one each in Extended Data Figs. 6a and 6b, were removed from 
analyses, as indicated in the legends of these figures. Continuous variables were also evaluated for departures from normality through 
quantitative examination of skewness and kurtosis, in addition to visual inspection of frequency distributions. Where departures of normality 
were identified, log10-transformations were applied, and distribution normalization was confirmed prior to parametric analyses. This has 
been done for main Fig.s 4h and 4k, and Extended Data Figs. 7a, 7b, 7d, and 7e. Since use of log10-transformations accounts for any non- 
normality this obviated the need for outliers exclusion. 

Further, as stated in our "Participant Inclusion and Exclusion critera" (see Methods), we included participants (245 years of age) with 
neuropsychologically-confirmed no cognitive dysfunction or mild cognitive dysfunction had no current or prior history of any neurological or 
psychiatric conditions that might better account for any observed cognitive impairment, including organ failure, brain tumors, epilepsy, 
hydrocephalus, schizophrenia, major depression. Participants were stratified by APOE genotype as APOE4 carriers (€3/e4 and €4/e4) and 
APOE4 non-carriers (€3/€3) also defined as APOE3 homozygotes who were cognitively normal or with mild cognitive dysfunction, as 
determined by clinical dementia rating (CDR) scores and the presence of cognitive impairment in one or more cognitive domains based on 
comprehensive neuropsychological evaluation including performance on ten neuropsychological tests assessing memory, attention/executive 
function, language and global cognition. For all analyses individuals with €3/e4 and €4/e4 alleles were pooled together in a single APOE4 
group, as we did not find in the present cohort consisting of 82-86% €3/e4 and 14-18% €4/e4 participants (depending on the outcome 
measure) a significant difference between the two versus one €4 allele for the studied parameters including the BBB permeability 
unidirectional transfer constant Ktrans values and sPDGFRB CSF values (see statistical section below). Individuals have been additionally 
stratified by AB and pTau CSF analysis as either AB-positive (AB1-42+, <190 pg/mL) or AB-negative (AB1-42-, >190 pg/mL), or pTau-positive 
(pTau+, >78 pg/mL) or pTau-negative (pTau-, <78 pg/mL), using accepted cutoff values. 

Participants were excluded if they were diagnosed with vascular cognitive impairment or vascular dementia. Clinical diagnoses were made by 
neurologists and criteria included whether the patient 1) had a known vascular brain injury and 2) the clinician judged that the vascular brain 
injury played a role in their cognitive impairment, and/or pattern and course of symptoms. In addition to clinical diagnosis, presence of 
vascular lesions was confirmed by moderate-to-severe white matter changes and lacunar infarcts by fluid-attenuated inversion recovery 
(FLAIR) MRI and/or subcortical microbleeds by T2*-weighted MRI. Participants were also excluded if they were diagnosed with Parkinson’s 
disease, Lewy body dementia or frontotemporal dementia. History of a single stroke or transient ischemic attack was not an exclusion unless 
it was related to symptomatic onset of cognitive impairment. Participants also did not have current contraindications to MRI and were not 
currently using medications that might better account for any observed cognitive impairment. 


Here, we recruited the participants from two sites, University of Southern California (USC), Los Angeles, CA, and Washington University 
(WashU), St. Louis, MO. At the USC site, participants were recruited through the USC Alzheimer’s Disease Research Center (ADRC): 
combined USC and the Huntington Medical Research Institutes (HMRI), Pasadena, CA. Our previous study findings regarding CSF biomarker 
analyses were replicated in two independent samples (Nation et al., Nat Med 2019). 

The DCE-MRI findings were also conducted at two sites (USC and WashU) using high-resolution 3T MR imaging brain sc an protocol. For USC 
site, a Siemens 3T Prisma scanner was used with a product 32-channel head receive coil and body transmit coil. For WashU site, a Siemens 3T 
mMR with 20-channel head coil and Siemens 3T Vida with 64-channel head coil were used. The standardization and optimization of the MRI 
protocol required several tests performed on a phantom. Specifically, scanner characterization and calibration sequences including BO, T1, 
and variable flip-angle mapping were implemented, optimized, and applied. After the achievement of good results in terms of quality control 
and reproducibility, we standardized and employed the same pre-contrast and dynamic T1-weighted protocols at both USC and Washington 
University sites. Of note, all the other MR sequences were identical too on both scanners. In order to minimize the inter-site variability, the 
entire MRI protocol including the anatomical and DCE pulse sequences were 100% mirrored from one site to another. To minimize inter-site 
variability, the same contrast agent Gadoterate meglumine (Dotarem®) were injected to participants at the same concentration (0.05 mmol/ 
kg). Finally, the same exact pre- and post-processing analysis pipeline was applied for both sites which includes T1 multi-FA mapping using 
linear fitting and Patlak-based DCE modeling using the arterial input function determined in each individual from the internal carotid artery. 
Applying all the above cited factors significantly limited inter-site variability. The consistency of the results from the two sites has been 
additionally confirmed by our previous publication. In brief, we performed the analysis of the combined DCE datasets from both USC and 
WashuU sites, and additionally site-specific analysis for each of the two sites separately, which showed no statistically significant differences 
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across sites. Recently, we invited a subset of 52 participants for an additional T1-weighted scan without contrast (using the same scanner and 
same MR pulse sequences) after their first DCE-MRI and measured both BO and T1 values at 2-year interval. This study showed that the results 
were unchanged and consistent across the scans, supporting minimal intra-site variability. USC and WashU DCE-MRI datasets were acquired 
and processed the same way, and then combined. 

The PET scans were also performed at two sites (USC and WashU). At USC site, a Siemens Biograph 64 PET scanner was used. At WashU site, 
FBP scans were acquired on a Siemens mMR and AV1451 scans were acquired on a Siemens Biograph mCT. The mCT session was used for 
attenuation correction of the mMR scans. Participants were injected with 300 MBq (+10%) of FBB or 370 MBq (+10%) of FBP. FBB and FBP 
images were acquired from 90 to 110 min and 50 to 70 min, respectively, after injection in accordance with the manufacturers’ 
recommendation. USC and WashU PET datasets were acquired and processed the same way, and then combined as described in the 
Methods. 

All attempts at replication for CSF, MRI, and PET analyses across sites were successful. 


Randomization We did not randomize since this was not an experimental design. 


Blinding All CSF assays, DCE-MRI and PET scans were conducted by investigators who were blinded to participant demographic information, clinical 
diagnostic status, and genotype. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used For quantitative western blot assay, the following primary antibody was used: 
- sSPDGFRB polyclonal goat IgG antibody (R&D Systems Catalog #AF1042, Lot #GOV0418041). Membranes were incubated with 
1 ug/mL of antibody overnight at room temperature, then incubated with donkey anti-goat IgG secondary antibody (Invitrogen, 
Cat #A15999, Lot #44-33-100114, 1:5,000 dilution) for 1 hour at room temperature. 


Validation Recombinant human sPDGFRB protein (R&D Systems Catalog #385-PR/CF, Lot #AMO0714072) was used as a positive control 
in validating the antibodies. Consistently, the manufacture's website specifies this antibody exhibits approximately 35% 
crossreactivity with recombinant human PDGFRB. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics All participants (n=435) underwent neurological and neuropsychological evaluations performed using the Uniform Data Set and 
additional neuropsychological tests, and received a venipuncture for collection of blood for biomarker studies. A lumbar 
puncture was performed in 350 participants (81%) for collection of cerebrospinal fluid (CSF). The dynamic contrast-enhanced 
magnetic resonance imaging (DCE-MRI) for assessment of blood-brain barrier (BBB) permeability was performed in 245 
participants (56%) that had no contraindications for contrast injection. Both LP and DCE-MRI were conducted in 172 participants. 
Among the 245 DCE-MRI participants, 74 and 96 were additionally studied for brain uptake of amyloid and tau PET radiotracers, 
respectively. 

- We studied 245 participants for MRI analysis with clinical dementia ratings of O and APOE3 genotype (n=130), 0.5 and APOE3 
(n=14), 0 and APOE4 (n=76), and 0.5 and APOE4 (n=25), who exhibited 0 cognitive domain impairment, 1, or 2+ (see Extended 
Data Table 1 for additional information regarding age, sex, and education). 

- We studied 74 participants for amyloid PET analysis with clinical dementia ratings of O and APOE3 genotype (n=45), and 0 and 
APOE4 (n=29) (see Extended Data Table 2a for additional information regarding age, sex, and education). 

- We studied 96 participants for amyloid PET analysis with clinical dementia ratings of O and APOE3 genotype (n=60), and 0 and 
APOE4 (n=37) (see Extended Data Table 2b for additional information regarding age, sex, and education). 

- We studied 350 participants for CSF sPDGFRbeta analysis with clinical dementia ratings of O and APOE3 genotype (n=157), 0.5 
and APOE3 (n=40), 0 and APOE4 (n=105), and 0.5 and APOE4 (n=48), who exhibited 0 cognitive domain impairment, 1, or 2+ (see 
Extended Data Table 3 for additional information regarding age, sex, and education). 


Recruitment Potential participants were identified from the clinics, the community, existing research cohorts or self-referred applications. 
After an initial prescreening performed by the recruiters, the study coordinators assessed and confirmed the eligibility of the 
participants before planning and scheduling the procedures and the clinical assessments related to the study. 
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The multi-modal approach for the identification of potential participants as well as the inclusion of participants from four 
different sites contributed to minimize the risk of self-selection bias. Furthermore, since the two main tests defining the grouping 
variables (i.e., the genotyping and the full neuropsychological assessment) are performed only after the enrollment of the 
participants, the bias related to the recruitment process potentially threatening the validity of the results is negligible. However, 
it should be noted that higher withdrawal rates were expected in the participants with mild cognitive dysfunction, potentially 
affecting the statistical power due to an unbalance between the cognitively normal participants and those with mild cognitive 
impairment. 


Ethics oversight University of Southern California (USC) maintains the complete program project and associated project Institutional Review 
Board (IRB) approvals reviewed and approved by USC IRB (Study IDs HS-16-00759 and HS-15-00875). All other sites also have 
local IRB approval. Banner University has IRB approval from WIRB (Study number 1173985). Huntington Medical Research 
Institute has approval from Quorum (QR# 33797/1). Mayo Clinic Arizona has approval from Mayo IRB (IRB#: 1709-97, Study ID 
PR1709-97-12). Washington University in St. Louis (WashU) has approval from the WashU Human Resources Protection Office 
(IRB ID#s: 201706100 and 201712150). 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Magnetic resonance imaging 


Experimental design 


Design type Resting state 
Design specifications N/A 


Behavioral performance measures N/A 


Acquisition 


Imaging type(s) Dynamic Contrast-Enhanced (DCE)-MRI 
Field strength 3T 


Sequence & imaging parameters From Methods section: Anatomical coronal spin echo T2-weighted scans were first obtained through the hippocampi 
TR/TE 8020/50 ms, NEX = 1, slice thickness 2 mm with 2 mm gap between slices, FOV = 175 x 175 mm, matrix size = 
448 x 448). Baseline coronal T1-weighted maps were then acquired using a T1-weighted 3D volumetric interpolated 
breath-hold sequence (VIBE) sequence and variable flip angle method using flip angles of 2°, 5°, 10°, 12°, and 15°. 
Coronal DCE-MRI covering the hippocampi and temporal lobes were acquired using a T1-weighted 3D VIBE sequence 
FA = 15°, TR/TE = 5.14/2.18 ms, NEX = 1, slice thickness 5 mm with no gap, FOV 175 x 175 mm, matrix size 320 x 320, 
voxel size was 0.550 x 0.550 x 5 mm3). This sequence was repeated for a total of 16 min with an approximate time 
resolution of 15.4 sec. Gadolinium-based contrast agent (GBCA), Gadoterate meglumine (Dotarem®, Guerbet, France) 
0.05 mmol/kg), was administered intravenously into the antecubital vein using a power injector, at a rate of 3 mL/s 
followed by a 25 mL saline flush, 30 s into the DCE scan. 


Area of acquisition Scan was coronal with slices covering hippocampi, temporal lobes and other in plane regions until the medial orbital 
frontal cortex. These regions were chosen due to their importance in aging, cognitive decline, Alzheimer's disease, and 
prior studies of blood-brain barrier dysfunction in older adults. 


Diffusion MRI Used Not used 


Preprocessing 


Preprocessing software From Methods section and Supplementary Information: 
- Blood-brain barrier permeability: Post-processing analysis was performed using Rocketship software running with 
Matlab. The arterial input function (AIF), which was extracted from a region-of-interest (ROI) positioned at the internal 
carotid artery (ICA), was fitted with a bi-exponential function prior to fitting with the Patlak model. In a few cases when 
the ICA was not clearly visible a nearby large arterial vessel was used. The Patlak linearized regression mathematical 
analysis was used to generate the BBB permeability Ktrans maps, as we previously reported. The high spatiotemporal 
resolution allowed not only simultaneous measurements of the regional BBB permeability in different white and gray 
matter regions, but also accurate calculations of the Ktrans values in small anatomical regions as thin as cortical gray 
matter areas. The present analysis requires that the tracer’s diffusion across the BBB remains unidirectional during the 
acquisition time. The total tracer concentration in the tissue, Ctissue (t), can be described as a function of the blood 
concentration, CAIF (t), the intravascular blood volume, vp, and a blood-to-brain transfer constant, Ktrans, that 
represents the flow from the intravascular to the extravascular extracellular space using equation (see Methods). A 
statistically significant intersubject variability in the measurement of vp was not observed. ROl-averaged analysis of 
DCE-MRI output maps was performed by an experienced neuroradiologist who manually drew ROls for each participant 
based on their own anatomy since a substantial variability between individuals is seen at a macroscopic level (e.g., 
enlarged ventricles, cortical atrophy, hippocampal shrinkage, etc.). Thus, the regional BBB Ktrans permeability were 
measured in 10 different gray matter ROIs including the hippocampus (HC), parahippocampal gyrus (PHG), caudate 
nucleus, thalamus, striatum, orbital frontal cortex (OFC), and inferior temporal gyrus (ITG), and white matter ROIs 
including subcortical watershed white matter fibers, corpus callosum, and internal capsule. 
- Volumetric analysis: HC and PHG morphometry were performed using the FreeSurfer (v5.3.0) software package 
(http://surfer.nmr.mgh.harvard.edu/), as previously performed. HC and PHG were segmented using FreeSurfer Desikan- 


illiany and subcortical atlases. Then, regional volumes (mm3) were derived accordingly. The technical details of this 
procedure are described in previous publications. Data processing and visualization were performed using the 
Laboratory of Neuro Imaging (LONI) pipeline system (http://pipeline.loni.usc.edu) and Quantitative Imaging Toolkit. 
Normalization /A 
Normalization template N/A 
Noise and artifact removal otion correction (for DCE-MRI) was applied using ImageJ's Stack Reg - Rigid Body plugin. 
Volume censoring /A 


Statistical modeling & inference 


Model type and settings N/A 
Effect(s) tested N/A 
Specify type of analysis: Whole brain ROI-based Both 


ROl-averaged analysis of DCE-MRI output maps was performed by an experienced neuroradiologist who 
manually drew ROls for each participant based on their own anatomy since a substantial variability 
between individuals is seen at a macroscopic level (e.g., enlarged ventricles, cortical atrophy, 
hippocampal shrinkage, etc.). Thus, the regional BBB Ktrans permeability were measured in 10 different 
gray matter ROls including the hippocampus (HC), parahippocampal gyrus (PHG), caudate nucleus, 

Anatomical location(s) thalamus, striatum, orbital frontal cortex (OFC), and inferior temporal gyrus (ITG), and white matter ROIs 
including subcortical watershed white matter fibers, corpus callosum, and internal capsule. Regions were 
chosen to cover a variety of brain regions and tissue compartments (e.g., cortical white matter, 
subcortical white matter, cortical grey matter, subcortical grey matter, and limbic areas of special focus-- 
in this case the hippocampus and medial temporal lobes). The regional brain volumes were measured in 
HC and PHC gyri. 


Statistic type for inference We did not conduct a voxel-level analysis. 
(See Eklund et al. 2016) 


Correction We did not conduct a voxel-level analysis. 


Models & analysis 


n/a | Involved in the study 


Functional and/or effective connectivity 


|__| Graph analysis 


Multivariate modeling or predictive analysis 


= 
jad) 
o 
S 
= 
o 
= 
o 
Za) 
© 
fev) 
= 
a 
a 
= 
o 
19) 
© 
oa 
5 
a 
Za) 
e 
3} 
= 
fev) 
5 
< 


Article 


Injured adult neurons regress toan 
embryonic transcriptional growth state 


https://doi.org/10.1038/s41586-020-2200-5 


Received: 12 April 2019 


Accepted: 13 February 2020 


Published online: 15 April 2020 


® Check for updates 


Gunnar H. D. Poplawski'™, Riki Kawaguchi2, Erna Van Niekerk’, Paul Lu“, Neil Mehta’, 
Philip Canete’, Richard Lie’, loannis Dragatsis’, Jessica M. Meves', Binhai Zheng", 
Giovanni Coppola?* & Mark H. Tuszynski'*~ 


Grafts of spinal-cord-derived neural progenitor cells (NPCs) enable the robust 
regeneration of corticospinal axons and restore forelimb function after spinal cord 
injury’; however, the molecular mechanisms that underlie this regeneration are 
unknown. Here we perform translational profiling specifically of corticospinal tract 
(CST) motor neurons in mice, to identify their ‘regenerative transcriptome’ after 
spinal cord injury and NPC grafting. Notably, both injury alone and injury combined 
with NPC grafts elicit virtually identical early transcriptomic responses in host CST 
neurons. However, in mice with injury alone this regenerative transcriptome is 
downregulated after two weeks, whereas in NPC-grafted mice this transcriptome is 
sustained. The regenerative transcriptome represents a reversion to an embryonic 
transcriptional state of the CST neuron. The huntingtin gene (Htt) isa central hubin 
the regeneration transcriptome; deletion of Htt significantly attenuates regeneration, 
which shows that Htt has a key role in neural plasticity after injury. 


The corticospinal tract (CST), which arises from the cerebral cortex 
and projects to the spinal cord, is the most important projection for 
voluntary movement in humans”. The failure of the CST to regenerate 
has been a major limiting factor inthe advancement of potential regen- 
erative therapies for the treatment of spinal cord injuries in humans**. 
Two experimental manipulations substantially enhance the growth of 
injured corticospinal neurons: (1) deletion of the tumour-suppressor 
genes Pten and Socs3°*, and (2) grafts of homologous neural stem cells 
or NPCs into sites of spinal cord injury!’. Deletion of Pren alone, or of 
Pten and Socs3, enables regeneration and/or sprouting only through 
small strands of host glia**, whereas NPC grafts enable robust regenera- 
tion of corticospinal axons into the lesion site itself". Untreated injured 
corticospinal axons without a growth-promoting NPC graft completely 
lack regeneration into the lesion site! (Extended Data Fig. 10a). To date, 
the intrinsic neuronal mechanisms that are recruited by neural stem 
cell grafts and enable robust corticospinal regeneration in the host 
have not been identified. 

Here we identify transcriptional mechanisms that are associated with 
corticospinal regeneration. We took advantage of the ability of NPC 
grafts to support corticospinal regeneration, together with a genetic 
mouse model (Glt25d2-eGFP-L10a mice, which express enhanced green 
fluorescent protein (eGFP)-tagged polyribosomes in layer-5b cortical 
neurons) to selectively enrich actively translated MRNA pools from 
layer-5b cortical projection neurons (including corticospinal neu- 
rons)®. We report the regenerative transcriptome of the corticospinal 
neuron. To our surprise, this regeneration state is triggered by injury 
alone, driving the corticospinal neuron to atranscriptional profile that 
is immature; NPC grafts successfully sustain the corticospinal regen- 
erative transcriptome over time, rather than recruiting anew growth 


program in neurons. We identify H¢t as an essential regulatory hub in 
the corticospinal regenerative state, and Htt-knockout mice exhibit 
substantially reduced corticospinal sprouting and regeneration. 


Three stages of CST regeneration 


It has previously been reported that grafts of spinal-cord-derived NPCs, 
orembryonic stem cells driven to neural stem cells of spinal cord iden- 
tity, support robust regeneration of corticospinal axons after spinal 
cord injury’™°. The availability of the transgenic GIt25d2-eGFP-L10a 
mouse line’ provided a tool for identifying the transcriptomic mecha- 
nisms that are associated with corticospinal regeneration. 

We performed dorsal column lesions at cervical spinal cord level 5 
(CS) to transect the corticospinal projectionin adult Glt25d2-eGFP-L10a 
mice®. One week later, we grafted eGFP-expressing NPCs derived from 
the embryonic-day-12 (E12) mouse spinal cord into the lesion (Fig. 1a, b). 
Analysis of anterogradely labelled corticospinal axons at 10, 14 and 
21 days after injury (3, 7 and 14 days after grafting) illustrated three 
anatomically distinct stages of the regeneration of corticospinal axons 
into grafts (Fig. Ic, d). (1) Inthe ‘pre-regenerative’ stage at 10 days after 
the lesion, corticospinal axons had not yet penetrated NPC grafts. The 
axons were still retracted within 1 mm of the lesion site. (2) Four days 
later, we observed an ‘early regenerative’ stage in which CST axons pene- 
trated the proximal region of the NPC graft. At this time, axons extended 
approximately 100 pm into grafts but lacked dense arborization. (3) 
Inthe ‘late regenerative’ stage at 21 days after the lesion, corticospinal 
axons extensively regenerated 0.5-1 mm into NPC grafts, arborized 
extensively and formed synapses onto grafted neurons (Fig. le, f). We 
accordingly characterized the transcriptional state at these three time 
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Fig. 1| Three distinct anatomical stages characterize the regeneration of 
corticospinal axons that is triggered by NPC grafts. a, b, Experimental 
timeline and illustration of experimental paradigm. c, CST axons labelled with 
the anterograde axonal tracer biotinylated dextran amine (BDA) have not yet 
penetrated eGFP-expressing fetal NPCs on day 10 after injury (3 days after 
grafting). By day 14 after injury (7 days after grafting), CST axons have begun to 
regenerate into the lesion site. By day 21 after injury (14 days after grafting), 
CST regeneration into grafts is extensive’. Scale bars, 100 pm. d, Quantification 
of regeneration of CST axons into the NPC graft at day 14 and day 21 after injury. 
n=3-5 biological replicates per time point, normalized to day 10, *P=0.046, 
one-way analysis of variance (ANOVA) with Tukey post hoc, mean+s.e.m. 

e, f, Arborization of regenerating CST axons (e) shows synaptophysin (SYNI) 
expression (f) when synapsing on NPC graft 21 days after injury. Scale bars, 
Spm. 


points (10, 14 and 21 days after injury), representing states of pre-, early 
and late regeneration. 


TRAP of regenerating CST neurons 


We characterized the transcriptional profile of regenerating versus 
non-regenerating adult corticospinal neurons in Glt25d2-eGFP-L10a 
mice by isolating the cellular pool of actively translated mRNAs 
using translating ribosome affinity purification (TRAP)”. In intact 
Glt25d2-eGFP-L10a mice, eGFP was expressed specifically in layer-Sb 
cortical pyramidal neurons (Extended Data Fig. 1a, b), including cor- 
ticospinal neurons (as previously reported®”). Thus, dissection of the 
motor cortex in these mice enables the identification of mRNAs that 
are highly enriched in corticospinal motor neurons: indeed, approxi- 
mately 50% at least of layer-5b pyramidal neurons in these mice are cor- 
ticospinal, identified by double labelling for eGFP and the retrograde 
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neuronal tracer cholera toxin B injected into the spinal cord at cervical 
levels (Extended Data Fig. 1c, d). mRNA that was affinity-purified from 
the motor cortex was therefore enriched in corticospinal neurons that 
project into the cervical spinal cord. We performed TRAP and RNA 
sequencing (RNA-seq) of the motor cortex at 10, 14 and 21 days after 
injury (Fig. 1a, b, Extended Data Fig. le) 


RNA-seq of regenerating CST neurons 


Gene-expression correlation analysis (Extended Data Fig. 2) showed 
that mRNA fractions isolated from eGFP-tagged polyribosomes (here- 
after referred to as ‘TRAPed’ samples) were distinct in their transcrip- 
tomic profiles from sample mRNA fractions that were not bound to 
eGFP-affinity-purified samples (hereafter, unbound samples), which 
confirmed the successful purification of TRAPed mRNA from layer-5b 
cortical neurons. The TRAPed fraction of intact mice was several-fold 
enriched in genes specifically expressed in neurons of cortical layer 
5b (as annotated by the Allen Mouse Brain Atlas”), including Fezf2", 
Parm1" and Crym>"* (Extended Data Fig. 3), and was depleted of glial 
genes as expected from an enriched neuronal sample (Extended Data 
Fig. 4). To further investigate the validity of the RNA-seq dataset, we 
confirmed that three sample genes that were differentially regulated 
upon regeneration of corticospinal axons (Sox9, Pax2 and Atg7) exhib- 
ited corresponding changes in protein expression (Extended Data 
Fig. 5). 


Regeneration prolongs the injury transcriptome 


Corticospinal regeneration is associated with the persistence of the 
neuronal injury signal: compared to intact mice, a total of 3,372 and 
1,776 genes were significantly differentially expressed in datasets from 
regenerating and lesioned control (non-regenerating) mice at the 
pre-regeneration time point (10 days after injury), respectively, using 
afalse discovery rate (FDR) of P< 0.1. There were 1,710 and 1,249 genes 
that were differentially expressed at 14 days after injury. At the late 
regeneration time point (21 days after injury), 3,037 and 365 genes were 
differentially expressed in the regenerating and lesioned control data- 
sets, respectively (Fig. 2a). Thus, regenerating corticospinal neurons 
maintained an active transcriptomic program in the presence of NPC 
grafts over the entire sampled time period (3,372, 1,710 and 3,037 dif- 
ferentially regulated genes at time points of 10, 14 and 21 days after 
injury, respectively), whereas the early rise in differentially expressed 
genes was progressively attenuated in lesioned, non-regenerating 
neurons (1,776, 1,249 and 365 genes at 10, 14 and 21 days after injury, 
respectively). 

A notable pattern that emerged from inspection of heat maps of 
these data was that many of the most-highly differentially expressed 
genes were altered in parallel in lesioned control and regenerating 
animals at the 10-day, pre-regenerative time point (Fig. 2b, Extended 
Data Figs. 6, 7). Many of these differentially expressed genes (compared 
to intact subjects) represented stem-cell-like gene families by Gene 
Ontology analysis (Fig. 3c), as well as the activation of proliferation-, 
differentiation-, and cell-cycle-progression-related functions (Fig. 2c, 
Extended Data Fig. 7). These include the upregulation of genes that have 
previously been shown to induce direct neuronal differentiation, such 
as Sox2and Ascll”. This finding suggests that one of the consequences 
of injury is the reversion or transcriptional reprogramming of the adult 
injured corticospinal neuron to a more-immature state. 

At 14 days after injury, when regeneration into grafts was beginning, 
many of the same genes that were upregulated in response to injury 
remained expressed in regenerating mice, but were beginning to lose 
expression in lesioned controls (Fig. 2b, Extended Data Fig. 6). Notably, 
one week later (21 days after injury)—when axonal regeneration was 
robust in NPC-grafted animals and failed in lesioned controls—only 
regenerating mice maintained expression of many of the same sets of 
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Fig. 2 | Transcriptomic analysis of regenerating corticospinal neurons 
reveals injury-induced reversal to an immature developmental state that is 
sustained upon regeneration. a, Number of significantly differentially 
expressed genes (DEGs) of injured (non-regenerating) and regenerating CST 
neurons relative to intact CST neurons (FDR< 0.1, Benjamini-Hochberg, 
two-sided) at days 10, 14 and 21 after injury. b, Heat map of top 1,000 DEGs 
(FDR < 0.1, Benjamini-Hochberg, two-sided) upon injury and upon 
regeneration relative to intact neurons, arranged by hierarchical clustering. 
Red, increased expression; green, reduced expression. The intensity of the 
colour reflects the degree of gene regulation. c, e-g, Pathway analysis of DEGs 
fromb. The state of activity of a signalling pathway or biological function is 
represented by az-score (right-tailed Fisher’s exact test). Red, activation; 
green, inhibition. Individual scaling parameters are listed at the bottom of each 
graph. c, Sustained activation of proliferation and differentiation pathways, 
only in regenerating CST neurons. d, Correlation analysis of DEGs between our 


genes that were activated as a function of initial injury (Fig. 2b, Extended 
Data Figs. 6, 7). 

To account for potential changes in gene expression due to physi- 
cal contact of axons with exogenous cell grafts (in the absence of 
axonal regeneration), an additional control group received grafts of 
bone-marrow stromal cells, which are non-permissive to CST regen- 
eration’. At 21 days after injury, gene expression in the bone-marrow 
stromal cell group was comparable to the lesion-only group without cell 
grafting (Extended Data Fig. 8a, b). These findings further demonstrate 
that the successful regeneration of the corticospinal system is accom- 
panied by sustained activation of transcriptional programs (Figs. 2, 3). 


Reversion to a neural development state 


Corticospinal regeneration is associated with the reversion to a devel- 
opmental state of gene expression: we compared the transcriptomic 
profiles of regenerating CST neurons toa previously published devel- 
opmental dataset of corticospinal neurons (Methods). We found a 
significant correlation (c=0.16, P< 6 x10’) between the transcriptomic 
profiles of regenerating corticospinal neurons from our regenerating 
dataset and E18 corticospinal neurons (Fig. 2d). This positive correla- 
tion was sustained throughoutall time points in the regeneration group, 
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dataset and embryonic (E18) CST neurons froma previous study (Gene 
Expression Omnibus (GEO) accession code GSE2039). Gene expression was 
ranked by log-transformed fold change for each dataset individually, and then 
correlated via rank-rank hypergeometric overlap. Correlation coefficients for 
each condition compared to the embryonic CST dataset are shown. 
Regenerating CST neurons correlate positively (red) with gene expression of 
embryonic CST neurons, whereas in the lesion-only group at 21 days we 
observed negative correlation (green). e, Sustained activation of cellular 
growth and sustained inhibition of apoptotic pathways, only in regenerating 
CST neurons. f, Increased axon regeneration is reflected in the activation of 
signalling pathways that promote axon regeneration (for example, mTOR) as 
well as inhibition of growth-inhibitory signalling pathways (PTEN). g, Synaptic 
activity and rewiring of circuits is activated through all time points of CST 
regeneration. n=2 biological replicates for day-10 non-regenerating and 

n=3 biological replicates for all other conditions. 


butinlesion-only controls shifted from a positive correlation at 10 days 
after injury (c= 0.14, P< 8 x 10°) to a negative correlation at 21 days 
after injury (c=-—0.16, P< 2 x 10°*°)(Fig. 2d). Moreover, compared to 
the strong correlation of our regenerating dataset to E18 corticospinal 
neurons, the correlation progressively diminished when compared 
to more-mature corticospinal neurons at postnatal day 3 (c= 0.11, 
P<9x10) and postnatal day 6 (c=0.10, P< 2 x 10") (Extended Data 
Fig. 8c), which suggests that the state of regeneration most closely 
correlates with an embryonic transcriptional state. This supports the 
hypothesis that injured corticospinal neurons initially revert to an 
immature, embryonic transcriptomic state, which is sustained over 
prolonged time points as regeneration proceeds. 


Unique transcriptional networks over time 

Gene Ontology analysis indicated that corticospinal regeneration was 
associated with an activated state of cellular growth and suppression 
of growth-inhibitory and apoptotic pathways (Fig. 2e). Significantly 
enriched signalling pathways involved in regeneration relate to classic 
axon growth and guidance molecules, and synaptic plasticity (Fig. 2f, g). 
Several genes (including members of the Erk family”, Mtor” and Igf1”) 
that have previously been identified as linked to regeneration were 
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Fig. 3 | Transcriptomic analysis of regenerating corticospinal neurons 
identifies Htt as a potential upstream regulator of regeneration. 

a-d, Pathway analysis was performed using all DEGs relative to gene expression 
in intact mice (FDR<0.1). The significance of a signalling pathway or biological 
function represented by a Pvalue using a right-tailed Fisher’s exact test is 
shown; darker orange denotes a higher significance. Individual scaling 
parameters are listed at the bottom of each graph. The Pvalue (right-tailed 
Fisher’s exact test) represents the significance of overlap between DEGs within 
each group and with genes classified as being involved ina particular category. 
a, The most significantly involved pathway in this dataset is axon growth and 
guidance, withincreasing involvement over time in the regenerating cohort 
and complete decline in the non-regenerating cohort. b, Genes involved in 


included among the genes that we found to be associated with regenera- 
tion (Fig. 3a), supporting the validity of this dataset. We also identified 
several additional genes that are potentially related to regeneration, 
as summarized below. 

The gene-expression profiles in the pre-regenerative (day-10) state 
that were distinct from those of intact mice included cell survival, pro- 
liferation (Fig. 2c, e), metabolism and axon guidance (Extended Data 
Fig. 8d), amongst which were Wnt genes, Ctnnb1 and genes involved 
in cAMP and PKA signalling. Genes in these datasets that were signifi- 
cantly differentially regulated included LIcam, WntSa, Wnt7b and the 
netrin receptor UncSb. We also observed upregulation of growth fac- 
tors (including /gf1 and Tgfa) and of transcription factors related to 
neuronal differentiation and axon regeneration (including Sox2, Ascl1, 
KIf3, Kif7, Satb1 and Tp53 (also knownas Trp53)) as well as downregula- 
tion of Junb, KIf2 and Atf2 (Supplementary Table 1). As noted above, 
many of these genes exhibited continued expression over the 21-day 
period during which regeneration was successful. Other genes that were 
significantly and persistently expressed with regeneration included 
proteins involved in axon guidance, suchas semaphorins (Sema3a and 
Semas5b); genes involved in axonal transport (such as KifSa, Uchi1 and 
Dlg2); and genes for cytoskeletal proteins (such as Tubb2b); as well as 
the upregulation of Nefland downregulation of Nefh (Supplementary 
Table 1). 
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synaptic plasticity become more significantly involved at 21 days after lesionin 
the regenerating cohort. c, DEGs involved in the pluripotency of stem cells are 
more significantly enriched over time as CST neurons regenerate, and less 
enriched as regeneration fails. hESC, human embryonic stem cell; mESC, 
mouse embryonic stem cell. d, Prediction of transcriptional regulators 
identifies Httas animportant potential upstream regulator of CST 
regeneration (P< 10x, right-tailed Fisher’s exact test for 21-day regenerating 
CST). e, The Htt regulatory network places Htt as aregulatory hubinthe 
regulation of genes involved in axon regeneration (for example, Bdnf, Trp53 and 
Stat3).n=2 biological replicates for day 10 non-regenerating and 

n=3 biological replicates for all other conditions. 


In the early regenerative (day-14) state, transcriptional networks 
related to protein translation, energy production and mitochondrial 
function were activated (Extended Data Fig. 8e), suggesting increased 
cellular activity to support axonal growth. We also observed the activa- 
tion of genes involved in classic regenerative pathways, suchas Mtor>”° 
(Fig. 2f, Extended Data Fig. 8e), and genes involved in PKA signalling” 
(Fig. 3a, Extended Data Fig. 8e). The activation of molecules known to 
be involved in the neuronal growth state and axon pathfinding also 
included Npy”, Rock2” and Plxnb3® (Supplementary Table 2). Specific 
individual genes differentially expressed at this time point included 
Nrp2, Map2k1 and Ptprz1 (Supplementary Table 2)—molecules that 
have previously been identified as important in central axonal regen- 
eration’ ”’, 

By the late regenerative state (day 21), we observed the upregula- 
tion of integrin genes such as /tga4, which could be involved in axon 
pathfinding; Stx12; genes for kinases, including Cdk1, Pak7 and Mapk3; 
and axon guidance molecules, including upregulation of the genes Ntn1 
and N¢f3; and downregulation of semaphorin genes, including Sema3c, 
Sema3e and Sema7a (Supplementary Table 3). Specific genes differen- 
tially expressed at this time point included K/f6, /gf2, the GABA, recep- 
tor genes, Kif5b and MyoSb (Supplementary Table 3). Voltage-gated ion 
channels, including Kcna4 and Kcna5, were also upregulated in the late 
regenerative state, possibly reflecting the establishment of functional 


synapses (Supplementary Table 3). The late regenerative phase corre- 
sponds to a stage in which regenerating axons are actively guided by 
and synapsing onneurons within the NPC grafts!*° *, and have shifted 
to asynaptogenic phase of growth (Figs. 2g, 3b, Extended Data Fig. 8f). 

Thus, successful corticospinal regeneration is characterized by at 
least two aspects of the transcriptional profile: (1) a reversion to an 
immature state, which persists as regeneration is underway, and (2) 
temporally shifting patterns of gene expression, in which anti-apoptotic 
cellular mechanisms are activated early after injury, shifting to the 
expression of genes that contribute to an active early regenerative 
state (including Pten suppression and Mtor activation” (Fig. 2f)) and 
finally—as axons have extended deeply into grafts and are forming 
synapses!*!—the expression of axon guidance molecules and synaptic 
genes (Figs. 2g, 3b). 


Httis essential for CST regeneration 


Transcriptional profiling can reveal previously unidentified mecha- 
nisms that mediate biological phenomena, such as axonal regenera- 
tion. Htt arose in our analysis as an unexpected and potentially key 
regulatory hub in axonal regeneration (Fig. 3d), among the top three 
networks that were statistically associated with regeneration. Members 
of the Htt regulatory network were enriched six-fold in cortical sam- 
ples associated with early axonal injury (P< 0.001) compared to late 
injury, and persisted at significantly elevated levels only inregenerating 
conditions (P< 0.001) (Fig. 3d, Extended Data Fig. 8e, Supplementary 
Table 4). Moreover, Htt occupied a central role as a hub in the regen- 
erating transcriptome (Fig. 3e), potentially influencing regeneration 
genes including Nfkb (also known as Nfkb1), Stat3 and Bdnf (Fig. 3e). 
Furthermore, Hithas potential functions inthe development of cortical 
projections**®, consistent with transcriptomic analyses that suggest 
that the injured or regenerating neuron reverts to an immature state. 
Although expansions of CAG trinucleotide repeats in Htt are associated 
with the development of Huntington’s disease, the typical role of Httis 
incompletely understood. Htt-knockout mice exhibit early embryonic 
lethality”, whereas the effects of conditional Htt deletion are somewhat 
controversial. One study has shown motor coordination abnormali- 
ties one month after conditional Htt deletion in adult mice’, whereas 
another study found no behavioural abnormalities up to seven months 
after conditional Htt deletion®®. Httis a regulator of Creb1®” and Tp53*° 
(Fig. 3d, e), and has previously been implicated in neurogenesis®*“, 
production and transport of BDNF”, NFKB expression® and axonal 
transport**. Given the identification of Htt as a central transcriptional 
mediator of axonal regeneration, we performed additional studies 
using mice with conditional Het deletion. 

Adeno-associated virus (AAV)8 vectors expressing tdTomato 
and Cre recombinase were injected into the right motor cortices of 
four-month-old Hté“"™ mice* (Fig. 4), a time point that could not 
affect neuronal development”. Six weeks later, we performed C4 
spinal-cord dorsal-column lesions, and grafted eGFP-expressing 
E12 spinal-cord-derived NPCs into the lesion. A separate group of 
Het" mice received AAV injections that lacked cre, followed by NPC 
grafts. Three mice that were euthanized six weeks after injection of 
AAV8-tdTomato-Cre confirmed the elimination of detectable HTT 
protein (Extended Data Fig. 9). Four weeks after graft placement, cor- 
ticospinal regeneration into grafts was reduced by 60% in mice with Htt 
deletion compared to Ht“ mice that lacked Cre injections (P<0.01, 
two-tailed t-test) (Fig. 4a-g). Thus, Htt is necessary for corticospinal 
regeneration, which identifies a role for Htt in neuronal repair. 

Inthe same mice, we also determined whether Htt deletion influenced 
spontaneous corticospinal sprouting into host grey matter above the 
lesion** *®, Indeed, Htt deletion resulted ina 50% reduction in sprouting 
at C2 after C4 spinal-cord lesions, compared to Htt®™"™ mice that lacked 
Cre injections” (P< 0.0001, two-tailed t-test) (Fig. 4h-n, Extended 
Data Fig. 10b). 
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Fig. 4| Httis essential for regeneration of corticospinal axons. Htt was 
conditionally deleted in the cortical spinal projections of the right motor 
cortex. a-d, CST axon regeneration is reduced in NPC grafts upon host Htt 
deletion. WT, wild type. Scale bars, 500 um. e, f, NeuN-positive graft areas are 
less penetrated by Htt-deficient CST axons (f) compared to Htt-positive CST 
axons (e). Scale bars, 20 um. g, Quantification of total axon length in pixel in 
graft area (normalized to individual axon counts inthe maintract, and then 
normalized to axon regeneration in the wild-type condition) showsa 
significant reduction inthe regeneration of CST axons upon host Htt deletion. 
**P= 0.0064, unpaired two-tailed t-test (n =11 wild type; n =13 Htt knockout). All 
quantifications were normalized to total axon counts inthe main CST in each 
subject. h-m, We also assessed sprouting of the CST into host cervical grey 
matter rostral to the lesion site at C2. The boxed areas inhandiareshown 
magnified inj andk, respectively; and the areas with dotted outlines inh andi 
are shown magnified inl and m, respectively. Scale bars, 200 pm (h, i); 25 um 
(j,k); 30 pm (I, m). n, Htt deletion inthe host significantly reduced sprouting of 
CST axons. ****P< 0.0001, unpaired two-tailed t-test, (n=15 wild type; n=13 Htt 
knockout). All quantifications were normalized to total axon counts inthe main 
CSTin each subject. Inthe box plots, centre line is the median, box extends 
from the 25th to the 75th percentile and whiskers denote minimum-maximum 
values. 


Discussion 


The regenerating corticospinal motor neuron transcriptome rep- 
resents a fundamental shift in state towards an immature cell, and 
requires Httto support successful repair. Notably, the transcriptional 
response to injury is nearly identical in the presence or absence of a 
regeneration-enabling cell graft, yet the pro-growth, immature cell 
state persists only when conditions permissive to growth (for example, 
astem cell implant in the lesion site) are present. 

The transcriptomic signature associated with regeneration 
evolves over time points that sample pre-, early and late regeneration 
states (Figs. 2, 3, Extended Data Fig. 8d-f). In pre-regeneration, many 
genes are pro-survival; subsequently, expressed genes have protein 
products that are associated with the axonal growth state (for example, 
mTOR) and axonal elongation (for example, cAMP, ERK and MAPK); 
finally, late regeneration genes reflect axon guidance and synaptogen- 
esis. Thus, in a two-week time frame, classical neurodevelopmental 
events occur that are associated with successful regeneration. 

Many transcriptional regulators related to corticospinal regenera- 
tion were identified in this study, including Htt. Indeed, conditional 
gene deletion showed that Htt is required for neuronal repair. Through- 
out life, neuronal maintenance and repair are essential to support 
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adequate cellular functioning; thus, Htt mutations are predicted to 
impair neuronal repair mechanisms—eventually resulting in neuronal 
degeneration. This is a testable hypothesis for future studies. 
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Methods 


No statistical methods were used to predetermine sample size. 


Study design 

We examined the regeneration transcriptome of the corticospinal neu- 
rontriggered by growth-promoting NPC grafts compared to ungrafted, 
lesioned controls. We used mice with layer-5-neuron-specific expression 
of eGFP-L10a under the Glt25d2 promoter. We pooled three mice for 
mRNA collecting per condition for each of three time points plus three 
mice for a bone-marrow stromal cell (MSC) control-graft condition. For 
in vivo studies in mice, we used asample size of 13 for Htt-wild-type mice 
and 15 for Htt-knockout mice. Numbers of replicates are listed in each 
figure legend. All experiments were randomized with regard to mouse 
enrolment into treatment group, and experimenters were blinded to 
experimental condition when performing data analyses. 


Mice 

All procedures involving mice were carried out in strict adherence 
to guidelines provided by The Guide for the Care and Use of Labora- 
tory Animals, and requirements of the institutional animal welfare 
committee. The Glt25d2 bacterial artificial chromosome (BAC) 
with eGFP-tagged ribosomal protein for TRAP (Glt25d2-eGFP-L10a) 
mice were obtained from the GENESAT project and maintained 
as previously described’. Experiments were performed on mice 
with one Glt25d2-eGFP-L10a and one wild-type C57BI/6 parent. 
Glt25d2-eGFP-L10a mice were backcrossed ten times to the wild-type 
C57Bl/6 background to avoid rejection of grafted NPCs, derived from 
either C57BL/6-Tg(CAG-GFP)10sb/J mice or from wild-type C57Bl/6 
mice. C57BL/6-Tg(CAG-GFP)10sb/] (eGFP mice) expressing eGFP CDNA 
under the control ofa chicken B-actin promoter and cytomegalovirus 
enhancer were bred with C57BL/6 (wild-type mice) female mice to 
produce eGFP-positive embryos. eGFP-mice and wild-type mice were 
obtained fromJackson Laboratory. Floxed Htt mice were generated by 
ID. (ref. *°) and backcrossed ten times to the wild-type C57BI/6 back- 
ground to avoid rejection of grafted NPCs. All mice were maintained 
ona12-hlight/dark cycle and given ad libitum access to food and water. 


Mouse surgery 

All surgery was done under deep anaesthesia using a combination of 
ketamine (80-100 mg/kg) and xylazine (5-10 mg/kg). Euthanasia for 
tissue collection was performed by injection of an overdose amount 
of anaesthesia cocktail in accordance with AVMA Guidelines for Eutha- 
nasia of Laboratory Animals. 


Dorsal column lesion 

To assess the regeneration of corticospinal tract axons, subjects 
underwent C2, C5 or thoracic (T)1 dorsal column lesions as previously 
described”. In brief, a Kopf wire knife (David Kopf Instruments) was 
inserted 0.3 mm lateral to midline and 1.1mm under the dorsal surface 
of the spinal cord; the knife was extruded 1.5 mm and lifted to transect 
the dorsal columns, with coincident compression with a 28-ga blunt 
tip from above to ensure lesion completeness. 


Transplantation surgeries 

For mouse-donor studies, grafting was performed 1 week after T1 dorsal 
column wire-knife lesion. E12 spinal cords from transgenic eGFP mice 
were dissected in ice-cold HBSS, digested in 0.25% trypsin for 20 min at 
37 °C, washed 3 times with 10% FBS in neurobasal medium, triturated 
in 5 ml neurobasal medium + B27, filtered with 70-um mesh, spun at 
200g for 5 min at room temperature, resuspended in 2 ml neurobasal 
medium + B27 and kept onice. Before grafting, cells were spun at 200g 
for 3 min at room temperature, supernatant was aspirated and cell pel- 
lets were resuspended at a concentration of 200,000 cells per plina 
fibrin matrix (25 mg/ml fibrinogen and 25 U/ml thrombin, Sigma F6755 


and T5772) containing growth factors to support graft survival’: BDNF 
(50 pg/ml, Peprotech, 452-02), neurotrophin 3 (NT-3; 50 pg/ml, 
Peprotech, 450-03), platelet-derived growth factor (PDGF-AA; 10 pg/ml, 
Sigma, P3076), insulin-like growth factor 1 (IGF-1;10 pg/ml, Sigma, 18779), 
epidermal growth factor (EGF; 10 pg/ml, Sigma, E1257), basic fibroblast 
growth factor (bFGF; 10 pg/ml, Sigma, FO291), acidic fibroblast growth 
factor (aFGF; 10 pg/ml, Sigma, F5542), glial-cell-line-derived neuro- 
trophic factor (GDNF; 10 pg/ml, Sigma, G1401), hepatocyte growth 
factor (HGF; 10 pg/ml, Sigma, H9661) and calpain inhibitor (MDL28170, 
50 uM, Sigma, M6690). Thrombin and fibrinogen cell suspensions were 
injected separately to allow for fibrin matrix formation within the spinal 
cord. Fibrinogen suspension was injected in all sites before thrombin 
suspension. The graft mixture was microinjected into the lesion cav- 
ity at Tl and into the intact spinal cord at C4 and C6 in two locations 
each. Each injection site received 0.83 pl total cell suspension, and 
subjects survived for either 3, 7 or 14 days after grafting. Lesion-only 
mice received growth factors in fibrinogen-thrombin mixture without 
cells.n=24 mice survived 3 days (9 with NPC graft, 9 without graft and 
6for histology), n=24 mice survived 7 days (9 with NPC graft, 9 without 
graft and 6 for histology) and n=33 mice survived 14 days after grafting 
(9 with NPC graft, 9 with MSC graft, 9 without graft and 6 for histology). 


Anterograde corticospinal tracing 

Nine days before perfusion, corticospinal tract axons were antero- 
gradely labelled by injection of 0.3 pl of 10% BDA (MW 10,000, Molecu- 
lar Probes) into each of 10 sites per hemisphere spanning the motor 
cortex (from bregma, rostral—caudal: +0.0, 0.5, 1 mm; lateral: +1.2, 
2.2 mm; depth: 0.7 mm). 


Retrograde corticospinal tracing 

To colabel CST neurons projecting into the cervical spinal cord that 
also express the BAC-eGFP-L10a transgene, we injected 0.2% cholera 
toxin B (CTB) in PBS bilaterally into the cervical (C4-C8) grey matter of 
Glt25d2-GFPL10a mice (n= 2). Each mouse received a total of 8 injections 
of CTB at a volume of 300 nl per site, spanning 4 sites rostrocaudally 
(-0.5 and +0.5 mm lateral from midline). Mice were perfused three days 
after injection and cortices were processed for histological assessment. 


Htt-knockout experiments 

AAV8 vectors expressing tdTomato and Cre recombinase 
(AAV8-tdTomato-Cre) were injected into the right motor cortices of 
four-month-old Ht“ mice’ (n = 15) (Fig. 4), a late time point that 
would not affect neuronal development”. Six weeks later, we per- 
formed C4 spinal-cord dorsal-column lesions to transect the corti- 
cospinal projection, and grafted eGFP-expressing wild-type-mouse 
E12 spinal-cord-derived multipotent NPCs from the same mouse 
background into the lesion site; we hypothesized that corticospinal 
regeneration would be reduced in these mice owing to Htt deletion. A 
separate group of He" mice (n = 13) received motor cortex injec- 
tions of AAV8 vectors expressing tdTomato but not Cre, followed 6 
weeks later by grafts of the same eGFP-expressing wild-type mouse 
E12 spinal-cord-derived multipotent NPC grafts to C4 spinal-cord 
dorsal-column lesions; we predicted that this group would exhibit 
corticospinal regeneration, as previously reported’. AAV8 viral vec- 
tors (Salk Institute Viral Vector Core) expressing either TdTomato 
(Htt** mice, n=13) or TdTomato plus Cre recombinase (Htt“ mice, 
n=15) transgenes under control of the CAG promoter were injected 
into forelimb and hindlimb areas of the right primary motor cortex 
and primary somatosensory cortex of Htt’"* mice at a concentra- 
tion of 5 x 10” viral particles per ml, volume 0.5 pl per site, 6 sites per 
side of the brain*® (coordinates from bregma: anterior-posterior 1 to 
—1 mm; medial-lateral 1.5 to 2.5 mm; depth 0.7 mm). Six weeks later, 
mice underwent a dorsal column lesion at C4 followed by immediate 
grafts of eGFP-expressing fetal spinal-cord-derived NPCs. Four weeks 
after grafting, mice were processed for histology. (n=15 Htt knockout; 
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n=13 Htt wildtype). Regenerating axons were quantified via the freely 
available Image] plugin, AxonTracer®. 


Fluorescent labelling 

Spinal cords or brains were removed from the vertebrae and skull and 
serially sectioned in the sagittal or coronal plane at 35-ym intervals. 
Every sixth section was used for antibody labelling. All steps were per- 
formed at room temperature unless noted otherwise. Sections were 
washed 3 times in TBST (0.025% triton-X100 in TBS) for 10 min, washed 
with 50% methanol in TBS for 5 min, dehydrated in 100% methanol for 
15 min, washed with 50% methanol in TBS for 5 min and blocked in TBST 
+5% donkey serum for Lh, and then incubated overnight in TBST + 3% 
donkey serum with primary antibodies at 4 °C. Sections were washed 
3 times in TBST + 3% horse serum and incubated 2.5 h with secondary 
antibodies, then washed 3 times with TBS for 10 min, mounted on glass 
slides and coverslipped Fluoromount G (Southern Biotechnology). 


Primary and secondary antibodies for fluorescent 
immunohistochemistry 

Rabbit anti-GFP (1:1000, A-11122, Invitrogen) with Alexa-488-conjugated 
donkey anti-rabbit (1:200; Jackson ImmunoResearch); rabbit 
anti-RFP-biotin (1:5,000; ab34771; Abcam) with Alexa-594-conjugated 
streptavidin (1:300); mouse anti-NeuN (1:250, MAB377, Millipore) with 
Alexa-647-conjugated donkey anti-mouse (1:200; Jackson ImmunoRe- 
search) for Fig. 3 and with Alexa-594-conjugated donkey anti-mouse for 
Supplementary Fig. 1; mouse anti-CTB (1:1,000, C86204M, Meridian 
Life Sciences) with Alexa-594-conjugated donkey anti-mouse; goat 
anti-SOX9 (1:1,000, AF3075, R&D Systems) and Alexa-594-conjugated 
donkey anti-goat; rabbit anti-PAX2 (1:500, 71600, Life Technology) 
and Alexa-594-conjugated donkey anti-rabbit; rabbit anti-ATG7 (1:500, 
LS-c156613, LS Bio) with Alexa-594-conjugated donkey anti-rabbit; rabbit 
anti-Cre (1:1,000; 908001, Biolegend) with Alexa-488-conjugated don- 
key anti-rabbit; nuclei were stained with 4’,6-diamidino-2-phenylindole 
dihydrochloride (DAPI) at 1 pg/ml (Sigma-Aldrich). 


Light-level HTT labelling 

PFA fixed cortical sections were incubated in sodium citrate buffer 
(10mM sodium citrate, 0.05% Tween 20, pH 6.0) for 10 min at 90 °C; 
washed twice for 5 min with TBS; quenched with 0.06% H,O, in TBS for 
20 min; washed twice for 2 min in TBS; blocked for 1h (M.O.M. mouse 
IgG blocking reagent in1 ml TBS), washed twice for 2 minin TBS; incu- 
bated in diluent for 5 min (400 ul of protein concentrate stock solution 
in 5 ml TBS); incubated with mouse HTT (1:100) MAB2166 in diluent 
overnight at room temperature, washed 3 times for 10 min eachin TBS 
incubated with M.O.M. biotinylated anti-mouse IgG reagent for 2.5 h; 
washed 3 times for 10 min each in TBS; incubated in VECTASTAIN Elite 
ABC HRP Kit (Vector Laboratories) for 30 min; washed 3 times for 10 min 
each in TBS; incubated in biotinyl tyramide (1:2,500) in 30% H,O, in 
TBS for 30 min.; washed 3 times for 10 min each in TBS; incubated with 
ABC (9 pI +9 plin PBS) for 1h; washed 3 times with TBS for 10 min each 
and stained with DAB; dehydrated with increasing concentrations of 
ethanol, followed by isopropanol and citrisolve for 2 min each. 


RNA isolation and sequencing 

We performed dissections of the motor cortex in a total of 63 
Glt25d2-eGFP-L10a mice (3 mice pooled per sample). Three intact 
samples (pooled from 9 mice) were studied, and a total of 18 lesioned 
samples (pooled from 54 mice) were studied at time points of 10, 14 
and 21 days after injury. Experimental time lines and procedures are 
illustrated in Fig. 1a, b and Extended Data Fig. le. Motor cortices were 
dissected, eGFP-tagged polyribosomes were affinity-purified via eGFP- 
antibody-protein-L complexes using streptavidin-labelled magnetic 
beads, and separate bound and unbound RNA fractions (consisting 
of CST neurons and non-CST material, respectively) were isolated””°. 
RNA was collected using the Absolutely RNA Nanoprep Kit (Agilent) 


according to the manufacturer’s instructions. Samples were shipped 
ondryiceto the University of California Los Angeles RNA-seq core for 
RNA quality control and RNA-seq onan Illumina HiSeq 2500 sequencer. 
Total RNA integrity was examined using the Agilent Bioanalyzer 2000 
(Agilent) and quantified with NanoDrop (Thermo Fisher Scientific). All 
but one sample passed RNA quality assessment; one sample from the 
lesion control group at 10 days after injury was excluded. Total RNA 
(20 ng) was used to generate cDNA using Ovation RNA-Seq System V2 
(NUGEN) following the manufacturer’s instructions. cDNA (100 ng) was 
used inthe library preparation using Ovation Ultralow Library Systems 
(NUGEN). The cDNA was fragmented to 300 base pairs (bp) using a 
Covaris M220 (Covaris), and then the manufacturer’s instruction was 
followed for end repair, adaptor ligation and library amplification. All 
samples were multiplexed into a single pool to avoid batch effects” and 
sequenced using an Illumina HiSeq 2500 sequencer (Illumina) across 
4 lanes of 69-bp paired-end sequencing, corresponding to 3 samples 
per lane and yielding between 31 and 69 (average of 42) million reads 
per sample. Alignment to the Mus musculus (mm10) refSeq (refFlat) 
reference gene annotation was performed using the STAR-spliced read 
aligner (v.2.3.0) with default parameters. Data quality was assessed on 
base-quality calls, nucleotide composition of sequences, insert sizes, 
per cent of uniquely aligned reads and transcript coverage using cus- 
tom scripts (available at https://github.com/icnn/RNAseq-PIPELINE) 
and picard-tools-1.118 (http://broadinstitute.github.io/picard/). Read 
alignments were visualized using the Integrative Genomics Viewer™. 
Sixty-four to seventy-seven per cent (average of 71%) of the reads 
mapped uniquely to the mouse genome. Total counts of read frag- 
ments aligned to candidate gene regions were derived using the HTSeq 
program (https://htseq.readthedocs.io) with mouse mm10 (Dec.2011) 
refSeq (refFlat table) as a reference and used as a basis for the quantifi- 
cation of gene expression. Only uniquely mapped reads were used for 
subsequent analyses. Differential expression analysis was conducted 
with R-project and the Bioconductor package edgeR (v.3.14.0)*. Sta- 
tistical significance of differential expression was determined at FDR 
<10% (q< 0.1). Ingenuity Pathway Analysis software (Qiagen) was used 
for data analysis (the complete gene list was used). 


Correlation with a previously published developmental dataset 
We applied ranked gene expression by log-transformed fold change 
to each dataset individually. This was derived by comparing our 
non-regenerating and regenerating data to our intact dataset, and by 
comparing embryonic day 18 (E18) CST neurons to postnatal day 14 
(P14) CST neurons in the previously published dataset'®. We then cor- 
related both datasets via rank-rank hypergeometric overlap». 


Statistical analysis 

All bar graphs are presented as the mean + s.e.m. For quantification, 
an unpaired, two-tailed Student’s t-test was used. For RNA-seq analy- 
sis, an FDR of 10% (FDR q < 0.1) was applied; a statistical significance 
criterion of *P< 0.05, *P< 0.01, ***P< 0.001, ****P< 0.0001 was used in 
all other experiments. For Fig. 1d, one-way ANOVA with Tukey post hoc 
test was used; for Fig. 2d, Extended Data Fig. 9, we applied ranked gene 
expression by log-transformed fold change to both datasets individu- 
ally. Experimenters were blinded to group identity when conducting 
experiments and performing analyses. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


RNA-seq data have been deposited in the Gene Expression 
Omnibus (GEO) repository (www.ncbi.nlm.nih.gov/geo), accession 
number GSE126957. 
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Extended Data Fig. 1| TRAP of regenerating corticospinal neurons. 

a, b, Coronal cortical section of transgenic BAC-TRAP mice expressing eGFP- 
L10A under the GIt25d2 promoter primarily in neurons (NeuN) of cortical layer 
5b. b, Confocal z-stack projection of 15-m thickness showing colocalization of 
eGFP-L10A and NeuN within pyramidal neurons. c, d, Retrogradely labelled 
CST neurons (CTB) projecting to the cervical spinal cord (C4-C8) grey matter 
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colabel with eGFP-L10A, indicating that about 50% of TRAPed mRNA 
molecules are derived from corticospinal neurons. d, Confocal z-stack 
projection of 10-pm thickness showing colocalization of eGFP-L10A and CTB 
within pyramidal neurons. n=3 biological replicates, which showed 
comparable results. e, Experimental timeline for purification of TRAPed mRNA 
molecules exclusively from cortical-layer-5 neurons. 
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Extended Data Fig. 2 | Sample Pearson correlation identifies unique RNA collected from the motor cortex (unbound samples), indicating that TRAP 
populations of mRNAs derived from CST neurons (TRAPed samples) and enriched for a uniquely defined mRNA population that is distinct from whole 
from whole motor cortex (unbound samples) after TRAP. mRNA species cortex samples. Samples are arranged by hierarchical clustering. 

from all conditions are more similar to each other (TRAPed samples) than total 
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Extended Data Fig. 3 | Insitu hybridization data from the Allen Mouse Brain 
Atlas confirms layer-5b-specific mRNA expression of genes highly 
enriched by TRAP from intact Glt25d2-eGFP-L10a mice. a, Fold change of 
most highly enriched mRNA transcripts in the corticospinal cell population 
(TRAPed samples) compared to mRNA from whole motor cortex (unbound 
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samples) inthe intact state. n= 3 biological replicates per condition. b, Insitu 
hybridization (ISH) data (Allen Mouse Brain Atlas; https://mouse.brain-map. 
org/) of most highly enriched genes (froma) confirms cell-type-specific 
enrichment of TRAPed mRNA expression in layer-5b neurons of the adult 
mouse motor cortex. 
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Extended Data Fig. 4| Neuronal genes are enriched in the TRAPed samples 
from layer-5b neurons. a, Neuronal marker genes are enriched in the TRAPed 


samples derived from layer-5b neurons. b, Astrocyte- and 
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motor cortex (unbound samples). n=3 biological replicates per condition. 
Data are mean +s.d. 
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Extended Data Fig. 5| Verification of RNA-seq data via immunolabelling for 
ATG7, SOX9 and PAX2. Immunohistological staining of selected proteins in 
layer 5b of the motor cortex of Glt25d2-eGFP-L10a transgenic mice. a, b, eGFP- 
L10A staining is shown in green and SOX9 staining is shown in red in intact mice 
(a) or inregenerating CST 10 days after injury with E12 NPC graft (b).c, Sox9 
mRNA expression after TRAP and RNA-seq. Data are normalized to those from 
intact mice. ****P=3.2 x10”, two-tailed t-test. d,e, eGFP-L1OA staining is shown 
in green and PAX2 staining is shown in red in intact mice (d) or in regenerating 
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CST 10 days after injury with E12 NPC graft (e). f, Pax2 MRNA expression after 
TRAP and RNA-seq. Data are normalized to those from intact mice. **P=0.0057, 
two-tailed t-test. g, h, eGFP-L1OA staining is shown in green and ATG7 staining 
is shown in red in intact mice (g) or inregenerating CST 10 days after injury with 
E12 NPC graft (h). i, Atg7 MRNA expression after TRAP and RNA-seq. Dataare 
normalized to those from intact mice. ****P=3.0 x10 °, two-tailed t-test. 

n=3 biological replicates per condition. Dataare mean +s.d. 
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Extended Data Fig. 6| Heat map of the top 1,000 significantly differentially 
regulated transcripts (FDR< 0.1) uponinjury and uponregeneration, 
relative to intact mice, with individual gene names. Heat map of the 

top 1,000 significantly differentially regulated transcripts (FDR<0.1, 
Benjamini-Hochberg, two-sided) upon injury and upon regeneration, relative 
to the intact state. Red, increased expression; green, reduced expression. The 


intensity of the colour reflects the degree of gene regulation compared tothe 
intact state. Transcripts are arranged by hierarchical clustering. n=2 biological 
replicates for day-10 non-regenerating, and n=3 biological replicates for all 
other conditions. Genes are listed from top to bottom. Black and grey boxes 
next to heat map indicate gene lists (columns 1to7 from left to right). 
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Extended Data Fig. 7 | See next page for caption. 


Extended Data Fig. 7 | Heat map of genes enriched in the Gene Ontology 
analysis (Ingenuity Pathway Analysis software). Heat map of Gene Ontology 
(GO)-term-associated significantly differentially regulated transcripts 

(FDR < 0.1, Benjamini-Hochberg, two-sided) upon injury and upon 
regeneration, relative to the intact state, at 10, 14 and 21 days after injury. Red, 
increased expression; green, reduced expression. The intensity of the colour 
reflects the degree of gene regulation compared tothe intact state. Transcripts 
are arranged by GO-term clustering (Ingenuity Pathway Analysis software). GO 
clusters listed on the right are indicated via colour code on the left side of the 


heat map. Many common genes have increased in expression in both 
non-regenerating and regenerating groups 10 days after injury. However, by 
day 14, these injury-induced changes are diminishing in the absence of agraft 
but are sustained in the presence of a graft, into which regeneration occurs. By 
day 21 after injury, many genes have returned to baseline levels in 
non-regenerating CST neurons, whereas they remain highly upregulated in 
regenerating CST neurons. The GO categories are derived from Figs. 2, 3. 
n=2biological replicates for day-10 non-regenerating, and n=3 biological 
replicates for all other conditions. 


Article 


A B 


Non-Regenerating Regenerating 
Day 21 Day 21 


MSC Graft -148 | 7) 
1371 I 1655 nec cratt 


2000 -1000 0 1000 2000 


noGraft 249 |] 16 


2000 -1000 0 1000 2000 


Number of sig. differentially expressed 
genes compared to intact 


Non-| -Regenerating Regenerating 
no NPC 


graft graft 
21d_ 21d 


D E 


Overlapping signaling pathways of regenerating 
vs. non-regenerating CST neurons (Day 10) 


Overlapping signaling pathways of regenerating 
vs. non-regenerating CST neurons (Day 14) 


Cc 


Developmental CST dataset (Macklis) 
E18 P3 P6 


ey 10 day (eot9) (0.003) (ove) 
i 04 = 
c 
© 14day fee ss » I 
c iy (62-29) (20-09) Gece) | ra 
a o€ 
(J) fo} 
Uv 
eS 21 day 70.28) (e043) ot | 5 
a= 
D> 7 
& 10day ee 2 Ss 5 
oO is) 
5 2s 
c fe} 
o l4day 3 (0.09) oa, £ 
Dn | o 
Z | ys 
H 
| 
3 21day  @s (te-05) “en | 
Zz 
Correlation regenerating/developmental 
Overlapping signaling pathways of regeneratin 


vs. non-regenerating CST neurons (Day 21) 


ess a ae ee ee me ee 
/ Pathways involved in ‘ iG Gluconeogenesis_ pathways involved in Pathways involved 
! Cellular Proliferation and Growth I sucrose - | Ox. Energy Production I in Synaptic u 
[Ovarian pax Glu-dependent ! 4 Deg. i | mx. horylati ErbB Signaling I 
| I | josphorylation =Thrombi 
| Cancer fe acid resistance ‘ia ee TCA Cycle ae HN GPCR 4 
1 Se __lMeaea ! Bhaagepitie Maturation Remodeling u 
l ons: Receptor ! Mitochondrial Adherens J 7 : cai 
I pluripotency 7 ‘ I Dysfunction Juncrions = I 
= xon) Adherens Ccaesiceniteaiin’ aq 
I, Gai ~ fidanes | Ephrin cum SZ - a aherens / Ze ! 
I ancer = saa : I { NeuropathicZZZ LTP ! 
I /S= Ephrin-B \ I Mistathmin 1 | Pain I 
I (iqgColorectal Cancer <Q I c guy Renal Cell Taxon ™m J 
N Metastasis | Mil Eph P a Carcinoma I Guidan: ln oe 7 
— a a a = B-adrenergic. /,"') Lig er l 
== ae pagan ye WP AN Micirca ian 
‘ - * PC 17 Tar Rhythm I cAMP f 
1 Estrogen- aay Z Cellular Effects 
‘Melatonin I kg inti 1 I Rho “En: i i 
i as | Receptor inscription I I AxonG wih bo of Sildenafil 
ji on Gro’ 
I (late st) Sp RXR I inv el \ ee ind Translation , \ 5 i a castrin 
Bi I NG a and Guidance d i 
I ier I Inhib. | Uracil ——e ee ee ee ¥< 7 B-adrenergic 
- ee oe 
| Dermatan Dermatan (late st.) I | Degradation of; p-Value 1E-2| ee 1E-4 => 
\ Biosynthesis of Sulfates ‘Molecules # p-Value  1E-4| 1-13 
p-Value 16-2) 1-8. 


Extended Data Fig. 8| The regeneration gene-expression profile is 
sustained at 21 days only with an NPC graft, and not with anon-stimulatory 
MSC graft, and correlates with neuronal developmental transcriptomes. 

a, Number of significantly differentially expressed genes of non-regenerating 
(no graft and MSC graft) and regenerating (NPC graft) CST neurons relative to 
intact CST neurons (FDR<0.1, Benjamini-Hochberg, two-sided) at 21 days after 
injury. Red, increased expression; green, reduced expression. b, Heat map of 
thetop 1,000 significantly differentially regulated transcripts (FDR< 0.1) upon 
injury and upon regeneration, relative to the intact state, at 21 days after injury. 
Red, increased expression; green, reduced expression. The intensity of the 
colour reflects the degree of gene regulation compared to the intact state. 
Transcripts are arranged by hierarchical clustering. The non-regenerating 
conditions (no graft and MSC graft) show gene-expression patterns similar to 
one another; and, in both cases, gene expression is not sustained incomparison 
to the regenerating condition (NPC graft). n =3 biological replicates. 

c, Differentially expressed genes (FDR < 0.1) of a previous neuronal 
development study (GSE2039) for different time points against P14 


(log-transformed fold change) was compared to CST data against the intact 
condition. Numbers indicate the Pearson correlation coefficient of the two 
datasets. Colour indicates the magnitude of the correlation coefficient (side 
bar). Values in parentheses are statistical significance (P values) of the 
correlation coefficients. n=2 biological replicates for day-10 non-regenerating 
and n=3 biological replicates for all other conditions. d-f, Overlapping 
signalling pathways of regenerating versus non-regenerating CST neurons 
(FDR <0.1) using Ingenuity Pathway Analysis software at 10 (d), 14 (e) and 21 (f) 
days after injury. The network of overlapping signalling pathways shows each 
pathway asa single ‘node’ that is coloured proportionally to the right-tailed 
Fisher’s exact test P value; darker red, more significant. Individual scaling of 
Pvalues is listed at the bottom of each graph. A line connects any two pathways 
whenthereis at least one dataset molecule incommon between them. 
Pathways of similar function are clustered together (dashed blue outlines). 
n=2biological replicates for day-10 non-regenerating and n =3 biological 
replicates for all other conditions. 
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Extended Data Fig. 9 | AAV8-tdTomato-Cre expression completely reductionin HTT protein (black or blue) inthe infected area.d,e, 

abolishes detection of HTT protein six weeks after injection. High-magnification images showing cortical neurons with baseline HTT 
AAV8-dependent Cre delivery to Htt"*“"" mice was used to conditionallyknock —_ protein expression outside of infected areas (d) and after HTT deletionin 
out HTT expression in corticospinal neurons six weeks before injury. a-c, Low cortical neurons that are transfected with AAV8-tdTomato-Cre (e). 
magnification of right motor cortex six weeks after AAV8-tdTomato-Cre f, AAV8-tdTomato-Cre-infected cortical neurons co-express tdTomato (red) 
injection indicates successful tdTomato (red) expression in motor cortex and and Cre (green). n=3 biological replicates, with comparable outcomes. 
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Extended Data Fig. 10| Lesioned corticospinal axons do not regenerate in matter underlying the lesion, consistent with previous reports** *®. Inthe box 
the absence ofan NPC graft. a, CST axons labelled with the anterograde plot, centre line is the median, box extends from the 25th to the 75th percentile 
axonal tracer BDA do not regenerate in the absence of a growth-stimulating and whiskers denote minimum-maximum values. n=8 biological replicates. 
NPC graft. The GFAP label indicates glial activation around the lesion site. **“P= 0.0027, two-tailed, non-paired test. 


b, However, CST axons in mice without grafts do sprout into spared host grey 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size For TRAP analysis we combined n=3 mice per group to harvest enough mRNA in good quality for RNA-sequencing. Every time-point had n=3 
biological replicates as is standard for RNAseq analysis. 
For HTT-KO experiments : 
A sample size of 11 subjects/group was determined by power analysis: 12 animals per group, standard deviation of 0.15, and effect size of 
0.24 yields computed power of 0.85. This design is sufficient to determine significant group differences .We thus used at least n=15 subjects 
per group to account for potential issues such as incomplete graft integration. 


Data exclusions We excluded one replicate of the non-regeneration group at 10 days post lesion since the mRNA from this sample did not pass quality 
assessment. 
For the assessment of regeneration into the NPC graft we excluded 4 animals in the group that received AAV-TdTomato and 2 animal in the 
group that received AAV-TdTomato-Cre due to poor graft survival. 
The exclusion criteria was pre-established. The NPC graft has to fill the complete lesion cavity and cells have to differentiate to 80% into NeuN 
positive cells. 


= 
red) 
a 
ie 
= 
o 
= 
O 
Za) 
© 
fed) 
a 
@ 
=F 
= 
O 
18, 
fe) 
= 
S 
2) 
a 
Wn 
(S 
= 
= 
jad) 
5 
< 


Replication For TRAP analysis every experiment was performed in triplicates. all experiments were successful except 1 replicate of the non-regeneration 
group at 10 days post lesion did not pass quality assessment. 


Randomization Mice for TRAP were randomly assigned into groups to mix gender and age in every group of pooled animals for the different time-points 
investigated. 


Blinding Investigators were blinded by randomization to the analysis of HTT-floxed mice that received either AAV-cre-tdTomato or AAV-tdTomato 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used rabbit anti-GFP (1:1000, A-11122, Invitrogen) with Alexa 488-conjugated donkey anti-rabbit (1:200; Jackson ImmunoResearch, 
West Grove, PA); rabbit anti-rfp-biotin (1:5000; ab34771; Abcam) with Alexa 594-conjugated Streptavidin (1:300); mouse anti- 
NeuN (1:250, MAB377, Millipore) with Alexa 647-conjugated donkey anti-mouse (1:200; Jackson ImmunoResearch, West Grove, 
PA) for Fig.3 and with Alexa 594-conjugated donkey anti-mouse for Fig.S1; mouse anti-CTB (1:1000, C86204M, Meridian Life 
Sciences) with Alexa 594-conjugated donkey anti-mouse; goat anti-Sox9 (1:1000, AF3075, R&D Systems) and Alexa 594- 
conjugated donkey anti-goat; rabbit anti-Pax2 (1:500, 71600, Life technology) and Alexa 594-conjugated donkey anti-rabbit; 
rabbit anti-ATG7 (1:500, LS-c156613, LS Bio) with Alexa 594-conjugated donkey anti-rabbit; rabbit anti-cre (1:1000; 908001, 
Biolegend) with with Alexa 488-conjugated donkey anti-rabbit; nuclei were stained with 4’,6-diamidino-2-phenylindole 
dihydrochloride (DAPI; [1 ug/mL]; Sigma-Aldrich, St. Louis, MO). ms-Htt (1:100, AB2166, Millipore) 


Validation A-11122 validated by PMID: 30429769 for IHC 
Ab34771 validated by PMID: 30948479 for IHC 
AB377 validated by PMID: 30948479 for IHC 
C86204M validated by PMID: 28361990 for IHC 
AF3075 validated by PMID: 28336567 for IHC 
7 
C 


7 12q012Q 


1600 validated by PMID: 30948479 for IHC 
156613 validated by supplier for IHC 
908001 validated by PMID: 27462817 for IHC 
AB2166 validated by PMID: 21994396 for IHC 


810 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals mice, C57Bl/6, mixed gender (age 6 to 12 month) 
mice, C57BI/6-HTT-flox-flox, mixed gender (age 6 to 12 month) 
mice, C57Bl/6-Glt25d2-eGFP-L10a, mixed gender (age 6 to 12 month) 


Wild animals The study did not invovle wild animals 
Field-collected samples The study did not contain field-collected samples 
Ethics oversight All procedures involving animals were carried out in strict adherence to guidelines provided by The Guide for the Care and Use of 


Laboratory Animals (The Institute of Laboratory Animal Resources, 2011), and requirements of the institutional animal welfare 
committee. National Institutes of Health guidelines for laboratory animal care and safety were strictly followed. All animal 
procedures were approved by the Institutional Animal Care and Use Committee of the Department of Veterans Affairs (VA) San 
Diego Healthcare System. Animals had free access to food and water throughout the study. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Photoreceptor loss is the final common endpoint in most retinopathies that lead to 
irreversible blindness, and there are no effective treatments to restore vision’. 


Chemical reprogramming of fibroblasts offers an opportunity to reverse vision loss; 
however, the generation of sensory neuronal subtypes such as photoreceptors 
remains a challenge. Here we report that the administration of a set of five small 
molecules can chemically induce the transformation of fibroblasts into rod 
photoreceptor-like cells. The transplantation of these chemically induced 
photoreceptor-like cells (CiPCs) into the subretinal space of rod degeneration mice 
(homozygous for rd1, also known as Pde6b) leads to partial restoration of the pupil 
reflex and visual function. We show that mitonuclear communication is a key 
determining factor for the reprogramming of fibroblasts into CiPCs. Specifically, 
treatment with these five compounds leads to the translocation of AXIN2 to the 
mitochondria, which results in the production of reactive oxygen species, the 
activation of NF-KB and the upregulation of Ascl1. We anticipate that CiPCs could have 
therapeutic potential for restoring vision. 


Many retinopathies—such as age-related macular degeneration, 
diabetic retinopathy and retinitis pigmentosa—ultimately result in 
the loss of retinal neurons, which leads to irreversible vision loss!. 
Stem-cell therapy, using embryonic stem cells or induced pluripo- 
tent stem cells, is a promising strategy to replace lost retinal cells and 
improve vision**. However, protocols for the derivation of candidate 
replacement cells are cumbersome and time consuming, presenting 
achallenge for their use in clinical therapy* ®. 

Direct reprogramming—using ectopic transcription factors and 
chemicals—bypasses the requirement for pluripotent cells and has 
resulted in the generation of neurons, astrocytes and cardiomyo- 
cytes; however, the pharmacological conversion of photorecep- 
tors using this method has not been realized’ °. ASCL1, a powerful 
proneural transcription factor, has been reported to reprogram 
glial cells into photoreceptors” *. An improved mechanistic under- 
standing of direct reprogramming may lead to the generation of 
new cell types. 

Here we identify a set of five small molecules that can induce fibro- 
blasts to become functional CiPCs without the need for pluripotent 
cells or viral transcription factors. We demonstrate that CiPCs restore 
pupil reflex and vision when transplanted into the subretinal space 
of rd1 mice, a mouse model of retinal degeneration. Moreover, our 
mechanistic analysis reveals an AXIN2—-NF-kKB-ASCL1 pathway that 
promotes retinal lineage during reprogramming and identifies 
mitochondria as a signalling hub in the orchestration of cell fate 
conversion. 


Aset of five compounds transforms fibroblasts to 
CiPCs 


To generate CiPCs, we used mouse embryonic fibroblasts (MEFs) 
derived from a transgenic Nr[-GFP mouse, in which the Nr/ promoter 
drives expression of eGFP specifically in rod photoreceptors””"*. We 
began by testing an established combination of four small molecules— 
valproic acid (V), CHIR99021 (a GSK3 inhibitor) (C), RepSox (R) and for- 
skolin (F), together denoted VCRF—that is known to convert fibroblasts 
into neurons, but only a few of the resultant cells were Nrl-GFP posi- 
tive”. Various small molecules and culture conditions were attempted 
(Supplementary Table 5), and we found that the Wnt/B-catenin pathway 
inhibitor IWR1 (1)—in combination with VCRF and STR (Sonic hedge- 
hog, taurine and retinoic acid)—was able to substantially improve the 
efficiency of conversion of MEFs into Nri-GFP* cells (Fig. 1a, b). STR was 
added on day 8 of reprogramming, to promote and support the forma- 
tion of photoreceptors after photoreceptor-specifying transcription 
factors—such as RORG, ASCL1, PIAS3—were upregulated’ ~ (Fig. 1I, 
Supplementary Tables 1, 2). Similarly, human adult dermal fibroblasts 
(HADF), and human fetal lung fibroblasts transduced with a Nrl-DsRed 
reporter, were also converted into CiPCs (Fig. 1f-h, j, Extended Data 
Fig. 1a, b, Supplementary Fig. 2, Supplementary Table 6). We next tested 
the small molecules individually and in various combinations, and 
observed that they failed to generate as many Nr/-GFP* cells (Extended 
Data Fig. 1c, d). We conclude that all five compounds in combination 
can efficiently convert fibroblasts into NRL-expressing CiPCs. 


‘Department of Pharmacology and Neuroscience, Laboratory for Retinal Rehabilitation, North Texas Eye Research Institute, University of North Texas Health Science Center, Fort Worth, TX, 
USA. ’Neurobiology, Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, MD, USA. “Department of Pharmacology and Neuroscience, 
University of North Texas Health Science Center, Fort Worth, TX, USA. “Nanoscope Technologies LLC, Bedford, TX, USA. ‘Department of Ophthalmology, University of North Carolina at 
Chapel Hill, Chapel Hill, NC, USA. °EyeCRO LLC, Oklahoma City, OK, USA. ’CIRC Therapeutics, Inc., Dallas, TX, USA. *These authors contributed equally: Koray Dogan Kaya, Yan Fan. 
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Fig. 1| Conversion of fibroblasts and the molecular characterization of 
CiPCs.a, Protocol for the reprogramming of mouse fibroblasts into CiPCs. MC, 
medium change; PIM, photoreceptor induction medium; PDM, photoreceptor 
differentiation medium. b, Images of CiPCs expressing Nri-GFP on day 11land 
day 16. Scale bars 14.4 mc, Images of CiPCs expressing CRX on day 11. Scale 
bars 14.4 umd, FACS purification of reprogrammed Nrl-GFP* CiPCs (0.2%). 

e, PCR with reverse transcription reveals the expression of the indicated 
photoreceptor-specific genes in mouse. For gel source data, see 
Supplementary Fig. 2a, b. f, Protocol for the reprogramming of HADFsinto 
CiPCs.g, Quantitative PCR (qPCR) analysis (fold change compared with HADF) 
CiPCs after reprogramming from HADF, showing increased expression of 
photoreceptor-specific genes. Data are presented as mean +s.e.m. ofn=3 
independently treated biological replicates. h, Micrograph of NRL-stained 
CiPCs after conversion from HADF. Scale bars 33 pmi, Principal component 
analysis for all RNA-seq samples (n=3 samples for each).j, Images of CiPCs after 
conversion from HADF at day 10. Revrn, recoverin; Rho, rhodopsin.Scale bars 
14.4 umk, Heat map of RNA-seq data for the indicated photoreceptor genes. 
RI, reprogramming intermediate. I, Heat map of RNA-seq data for the 
expression of the indicated genes that express retinal transcription factors 
during CiPC conversion. Experiments inb, h, iwere repeated three times with 
similar results and the experiments inc, d, were repeated twice with similar 
results. CPM, counts per million. 


Next, Nri-GFP* CiPCs were purified by fluorescence-activated cell 
sorting (FACS) and subject to transcriptomic analysis, which revealed 
the expression of early retinal neuronal markers (CHX10 and OTX2) 
and photoreceptor markers (CRX and NRL) (Fig. 1d, e, Supplemen- 
tary Table 3). Immunostaining analysis of CiPCs on day 11 enabled the 
visualization of CRX and CHX10 (Fig. 1c, Extended Data Fig. 1c, Sup- 
plementary Table 4). Molecular analysis of reprogrammed human 
CiPCs revealed the expression of photoreceptor-specific markers such 
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as CRX, rhodopsin and recoverin (Fig. 1g—i, Extended Data Fig. Ic, d, 
Supplementary Fig. 2). 


CiPCs express photoreceptor genes 


Transcriptome profiling demonstrated that the transcriptome of 
Nrl-GFP* CiPCs resembles that of native rods, which were used as 
a positive control’. Heat map analysis revealed the expression of 
rod-specific genes in CiPCs (Fig. 1i-I, Extended Data Fig. 9a, b), although 
progenitor-specific genes of other retinal cells—such as cone cells, 
ganglion cells and Miller glial cells—were expressed at very low levels 
(Extended Data Fig. 2b). Moreover, fibroblast-specific genes were found 
to be downregulated in CiPCs and in reprogramming intermediates 
(Extended Data Fig. 9c). Expression of retinal transcription factors— 
such as Otx2, Nrl and Crx—was also evident in CiPCs (Fig. 1k). Notably, 
photoreceptor-specific transcription factors—such as Rorb, Ascl1, 
Pias3, Thrb and Rxrg—were upregulated in reprogramming intermedi- 
ates”**5 (Fig. 11). These results indicate that CiPCs havea similar gene 
expression profile to that of their native counterparts. 

Because MEFs are composed of a heterogeneous cell population, we 
performed lineage tracing with Fsp1-Cre;Ai9 MEFs (after FACS puri- 
fication) and confirmed that CiPCs can originate from fibroblasts 
(Extended Data Fig. le—-h). Staining with 5-bromodeoxyuridine (BrdU) 
during reprogramming to CiPCs indicated that approximately 95% of 
Nrl-GFP* cells were negative for BrdU, suggesting that no intermediate 
proliferative stage exists (Extended Data Fig. 2a). 


CiPC transplantation in rd1 mice 


To test whether CiPCs can activate existing retinal circuitry and restore 
visual function, FACS-purified Wri-GFP* CiPCs were transplanted into 
the subretinal space of 14 rd1 eyes, amouse model of retinal degenera- 
tion (Extended Data Fig. 2c). Recently, pupillary light reflex has been 
reported as arobust method to measure photoreceptor function after 
cell transplantation”®. Pupillary constriction under low light levels is crit- 
ically dependent upon the function of rod photoreceptors. Six out of 14 
(43%) rd1 eyes demonstrated improved pupil response under low-light 
conditions three and four weeks after transplantation (Fig. 2a, b). 
None of the mice demonstrated pupil constriction at two weeks, which 
served as a baseline (internal negative control) for longitudinal com- 
parison and reduced the likelihood that a pre-existing photoreceptor 
or alternative pupillary pathway—such as the melanopsin pathway—is 
responsible for the observed pupillary restoration (Fig. 2b). To assess 
the restoration of visual function, the six pupil-responsive mice under- 
went light-aversion testing. The light-aversion test provides an option 
for mice to spend time in either a dark space or alit space. Mice have an 
innate tendency to avoid lit spaces; as such, those with visual function 
favour dark spaces”. To test rod vision in these pupil-reflex-positive 
mice, we allowed the mice to adapt to dark conditions and then per- 
formed the test using illumination conditions of 50 Ix. rd1 mice injected 
with CiPCs were found to spend significantly more time in the dark 
space compared with those injected with PBS control (Fig. 2c). None 
of the CiPC-injected mice without improved pupil responses demon- 
strated a dark preference. We also performed a modified optomotor 
test with neutral density filters; this modification was necessary in 
order to prevent rod saturation and to test rod-mediated vision. An 
additional benefit of testing at low light intensities is that the possibil- 
ity of activating any residual cones is substantially reduced. One of the 
six CiPC-injected mice that demonstrated improved pupil responses 
and dark preference also showed improvements in visual acuity and 
contrast sensitivity (Extended Data Fig. 3e, f). These results providea 
proof-of-concept that CiPCs can restore visual function in rd1 mice. 
We next sought to rule out the possibility that CiPCs primarily func- 
tion to reduce the degeneration of host photoreceptors. We trans- 
planted Nri-GFP* CiPCs into the subretinal space of rd1 mice on day 
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Fig. 2 | Functional analysis of CiPCsin a mouse model of retinal 
degeneration. a, Representative images from the pupil analysis. CiPC-injected 
eyes 3 months after transplantation. b, Illuminance response curves for 
pupillary constriction (n=6). The stimulus was a 20-s light exposure and the 
data are fitted with a sigmoidal function. Arrows show the recovery of pupil 
function. The results presented are for pupil-responsive mice only. Statistical 
significance was assessed using a two-tailed Student’s ¢-test. WT, wild type. 

c, The time spent by mice in the dark area of the light-aversion test. Wild-type 
C57BL/6 mice were used for comparison. Statistical significance was assessed 
using a one-way ANOVA. Dataare presented as mean+s.e.m.d, Top left, 
integration and survival of GFP* CiPCs in the retina of anrd1 mouse 3 months 
after transplantation. Scale bar, 20 um. Top right, a magnified view of the top 
left image. Scale bar, 10 pm. Bottom, additional images of the integration of 
GFP* CiPCs. Bottom left, 40x magnification; bottom right, z-section. Scale 
bars: left, 10 pm; right, 20 pm. GCL, ganglion cell layer; INL, inner nuclear layer. 
Experiments inaand d were repeated twice with similar results and denotes 
the number of biologically independent mice. 


31—a later time point at which there is near complete photoreceptor 
degeneration and electroretinogram (ERG) signals are extinguished”®. 
Long-term retinal function was analysed by pupillary light reflex and 
ERG (Extended Data Fig. 2c). Approximately three months after trans- 
plantation (at postnatal day (P)128), we recorded a30%-40% increase 
in pupillary constriction in response to a low light stimulus (50 Ix) in 
three of the six eyes into which CiPCs had been transplanted (Extended 
Data Figs. 2c and 3d). Full-field ERG analysis demonstrated an improve- 
ment of the scotopic a-wave in three out of six eyes into which CiPCs 
had been transplanted at P45; this improvement was lost at and after 
P59 (Extended Data Fig. 3a-c). One eye showed improvement in the 
scotopic b-wave at P45 (Extended Data Fig. 2d). 

The long-term survival of transplanted CiPCs and their synaptic 
connections to inner retinal neurons were assessed by immunofluo- 
rescence. Three months after CiPC transplantation, we identified GFP", 
GFP*recoverin*® or GFP*rhodopsin* CiPCs inthe outer aspect of the inner 
nuclear layer in eyes that demonstrated restoration of the pupillary 
light reflex (Fig. 2d, Extended Data Figs. 4a, b, 8d). We quantified CiPC 
survival and found a strong correlation between pupil constriction and 


cell survival (Extended Data Fig. 8e). Specifically, eyes with improved 
pupil response and light/dark response had an average of 58 CiPCs 
per section, compared with 8 CiPCs per section in non-responders 
(Extended Data Fig. 8f). Moreover, CiPCs were found to have synaptic 
terminals that express the rod ribbon-synapse protein ribeye and the 
synaptic vesicle protein synaptophysin. These synaptic terminals arein 
close proximity to rod bipolar cells (expressing bipolar marker PKC-a), 
which is essential for transmitting a light signal into the inner retina 
(Extended Data Fig. 4c-e). Taken together, these results suggest that 
some of the transplanted CiPCs survive, function and connect with the 
inner retinal neurons of rd mice. 


NF-KB induces ASCL1 during conversion to CiPCs 


To explore the mechanism of reprogramming to CiPCs, we identified 
candidate transcription factors from RNA sequencing (RNA-seq) analy- 
sis. We detected the expression of Asc/1 on day 5, which continued to 
increase until day 8 (Fig. 1I, Fig. 3a). ASCL1, a proneural transcription 
factor, is capable of converting fibroblasts into neurons”. Further- 
more, ASCL1is reported to be transiently expressed in photoreceptor 
precursors and can reprogram Miller glia into rod photoreceptor-like 
cells5°, Taken together, we proposed that the Asc/J transcriptional 
network has a central role in reprogramming fibroblasts to CiPCs. To 
test this hypothesis, we reprogrammed Ascl1-depleted (Fig. 3b) Nrl-GFP 
MEFs with the set of five compounds (VCRFI). We noted a 70%-80% 
reduction inthe generation of CiPCs, which suggests that ASCL1 has an 
important role inthe reprogramming of fibroblasts to CiPCs (Fig. 3c). 

To determine the mechanism of ASCL1 induction, we investigated 
potential upstream regulators of ASCL1. NF-kB is a rapidly acting pri- 
mary transcription factor that is known to have arole in neural stem-cell 
differentiation and embryonic neurogenesis”. We therefore postulated 
that NF-kB is an upstream regulator of ASCL1 induction. A luciferase 
assay demonstrated that NF-kB activation began on day5 and reacheda 
maximum at day 11 (Fig. 3d). These results suggested the involvement of 
NF-KB, and so we next explored whether NF-kB induces the expression 
of ASCL1. Bioinformatics analysis (using rVista) identified a putative 
binding site for NF-KB near the AsclJ loci (Fig. 3e), and a chromatin 
immunoprecipitation (ChIP) assay confirmed the binding of NF-KB at 
this locus (Fig. 3f). Furthermore, transient transfection analysis witha 
luciferase reporter gene confirmed that NF-KB positively regulates Ascl1 
during reprogramming to CiPCs. NF-KB depletion in MEFs that were 
subsequently reprogrammed to CiPCs resulted in reduced expression 
of Ascl1 and fewer CiPCs overall, which further confirms the pathway 
(Extended Data Fig. 5a—c). Overexpression of AsclJ alone in MEFs was 
not sufficient for reprogramming to CiPCs (Extended Data Fig. 5f-h). 
These results indicate that treatment with our set of five compounds 
leads to the activation of NF-KB, which in turn binds to the regulatory 
regions of Asc/1 and controls its expression. Notably, analysis of CiPCs 
using an assay for transposase-accessible chromatin with sequencing 
(ATAC-seq) revealed the presence of open chromatins at the upstream 
regions of the Asc/1 gene in reprogramming intermediates. Homer 
analysis revealed the enrichment of Ascli and NF-KB-binding motifs in 
bothintermediates and reprogrammed cells, which may be important 
for the binding of regulatory transcription factors and for their expres- 
sion (Extended Data Fig. 9f, Supplementary Table 7). 


Reactive oxygen species activate NF-KB 

To investigate the mechanism by which NF-kB activation is induced by 
the combination of five small molecules, we considered known induc- 
ers of NF-kB—suchas TNFa, lipopolysaccharide, ionizing radiation and 
mitochondrial reactive oxygen species (mROS)—as possible candi- 
dates”’, mROS have been reported to induce nuclear gene expression 
through the activation of NF-KB”. We therefore postulated that mROS 
generated by treatment with the five compounds may activate NF-KB. 
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Fig. 3 | The mROS-NF-KB-ASCLI signalling axis determines the 
reprogramming of fibroblasts to CiPCs. a, Asc/J transcript expression during 
reprogramming of MEFs to CiPCs, analysed by qPCRusing the2“*“" 

method. RI, reprogramming intermediates, R, reprogrammed cells. 

b, Ascl1 qPCR analysis in Ascl1 depleted MEFs, using short hairpin RNA (shRNA). 
c, Number of GFP* cells after reprogramming to CiPCs from Ascl1-knockdown 
(KD) and wild-type Nri-GFP MEFs. d, NF-KB-luciferase activity during 
reprogramming of MEFs to CiPCs. LPS, lipopolysaccharide. e, rVistasequence 
alignment of human ASCL1 and mouse Ascl/1 genes show highly conserved 
NF-kB-binding sites (red vertical bar) downstream of the 3’UTR region. f, ChIP 
assay shows binding of NF-KB at downstream loci of Ascl1. g, Accumulation of 
mROS in CiPCs converted from Nri-GFP MEFs on day 11. Scale bar 33 pmh, 
Fluorometric analysis of mROS generation during reprogramming of MEFs to 
CiPCs. AntiA, antimycin A; RFU, relative fluorescence units. i, Quantification of 
Nrl-GFP* cells on day 11 after depletion (by MitoTEMPO treatment) or generation 
(by Tfam knockdown) of mROS.SM, small molecules. Ina-d, f,h,i, dataare 
presented as mean +s.e.m. of n=3 independently treated biological replicates. 
Statistical significance was assessed by two-tailed Student’s t-test. 


We observed an evident increase in mROS accumulation, beginning 
on day 8 inreprogramming intermediates (Fig. 3g, h). To determine 
the importance of mROS, we used the antioxidant MitoTEMPO, an 
mROS scavenger, and observed a reduction in reprogramming to CiPCs 
(Fig. 3i). Depletion of mitochondrial transcription factor A (TFAM) has 
been reported to lead to the generation of mROS**. Our data show that 
withdrawal of the compound IWR1 reduces reprogramming efficiency 
and mROS generation (Fig. 3i, Extended Data Fig. 6f). We considered the 
possibility that TFAM-depleted MEFs may have increased reprogram- 
ming potential in the absence of IWR1. Consistent with this hypothesis, 
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we found increased reprogramming to CiPCs and increased mROS 
generation in TFAM-depleted MEFs in the absence of IWR1 (Fig. 3i, 
Extended Data Fig. 6e, f). Furthermore, exogenous ROS or TNFa 
(a NF-KB inducer) did not havea significant effect on CiPC conversion 
efficiency, and excessive ROS had a negative effect (Extended Data 
Fig. 5d, e). These data indicate that mROS has an important role inthe 
reprogramming of fibroblasts to CiPCs. 

To determine whether the activation of NF-KBis dependent on mROS, 
we added MitoTEMPO on day 3 of reprogramming and observed signifi- 
cantly decreased activity in the luciferase assay (Extended Data Fig. 6a). 
Additionally, we found that the extent of binding of NF-kB near the Ascl1 
loci was reduced upon treatment with MitoTEMPO (Extended Data 
Fig. 6g). Taken together, these results demonstrate that mitochondrial 
ROS activates NF-KB, which controls the expression of Ascl1 by binding 
its regulatory region. 


The five compounds promote Axin2 mitolocalization 


To identify the mechanism by which mROS is generated during repro- 
gramming to CiPCs, we considered two reports that demonstrate the 
stabilization of AXIN2 by treatment with CHIR and IWR1* and the 
translocation of AXIN2 to mitochondria upon treatment with XAV939 
(an IWR1 analogue)**. We proposed that treatment with the group of 
five compounds—which includes IWR1 and CHIR—induces the stabi- 
lization of AXIN2 and its subsequent translocation to mitochondria; 
mitochondria-targeted AXIN2 then generates mROS, whichin turn acti- 
vates NF-KB. To test the hypothesis, the expression of AXIN2 was exam- 
ined during the reprogramming to CiPCs. We discovered that AXIN2in 
reprogramming intermediates and in day 11non-purified CiPCs is more 
stabilized compared with that in starting MEFs (Fig. 4a). Subsequently, 
we found that stabilized AXIN2 translocates to the mitochondria of 
CiPCs, as evidenced by its co-localization with mitochondria (Fig. 4b, c, 
Extended Data Fig. 7a, b). Axin2-depleted MEFs demonstrate reduced 
reprogramming to CiPCs and reduced mROS generation (Fig. 4d-f, 
Extended Data Fig. 6d). For further confirmation we measured NF-KB 
activation and Ascl1 expression in Axin2-depleted reprogramming 
intermediates. We detected reduced Ascl1 expression and decreased 
NF-KB activity (Fig. 4g, Extended Data Fig. 6b). Taken together, these 
results indicate that mitochondria-targeted AXIN2 induces increased 
mROS generation in CiPCs and activates the downstream NF-KB-ASCL1 
pathway to promote a lineage switch to a photoreceptor fate. Moreo- 
ver, mitochondrial analysis of CiPCs revealed low basal mitochondrial 
respiration, low ATP turnover, low reserve capacity and high glycolysis 
rates; this is indicative of animmature mitochondrial state, which may 
be linked to the generation of mROS (Extended Data Fig. 7c-f). 


Discussion 


Here we report that acombination of five small molecules can convert 
fibroblasts into functional CiPCs that are capable of partially restoring 
pupil reflex and visual function in amouse model of retinal degenera- 
tion (Extended Data Fig. 10a). Gene expression profiling reveals that 
CiPCs are similar to their in vivo rod counterpart, and CiPC conversion 
recapitulates the ontogeny of photoreceptor genesis—as indicated 
by the upregulation of Thrb, Rorb and Pias3, among others”. The 
occurrence of open chromatin regions near photoreceptor loci and the 
enrichment of photoreceptor-specifying transcription-factor-binding 
motifs during reprogramming to CiPCs further validate these results*”** 
(Extended Data Fig. 9f, Supplementary Table 7). Subretinal transplanta- 
tion of rod-like CiPCs into rd1 mice led to long-term improvement in 
pupillary light reflex and the restoration of normal visual behaviour in 
the light aversion test. In our study, 6 out of 14 mice (43%) demonstrated 
animproved pupillary response. The strong correlation between pupil 
constriction and cell survival may explain why some eyes showed an 
improvement and others did not (Extended Data Fig. 8e). 
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Fig. 4| Mitochondria-translocated AXIN2 causes mROS generation and the 
reprogramming of fibroblasts to CiPCs. a, Western blot demonstrating 
AXIN2 expression during reprogramming of MEFs to CiPCs. For gel source 
data, see Supplementary Fig. 1c, d.b, Noco-localization of AXIN2 (green) 

with mitochondria (MitoTracker) was observed in MEFs. Scale bar, 13.4 um. 

c, AXIN2 (red) localization in mitochondria (green) of GFP* CiPCs (purple 
pseudocolour). Scale bar, 2 um. d, Western blot demonstrating reduced levels 
of AXIN2 after knockdown of its encoding gene, Axin2, by shRNA. For gel 
source data, see Supplementary Fig. 1g, h. In a-c, experiments were repeated 
twice; ind, the experiment was performed once. e, Quantification of Nri-GFP* 
cells after conversion of Axin2-depleted MEFs to CiPCs on day 11. f, Reduced 
mROS generation in Axin2-depleted cells during reprogramming to CiPCs, as 
assessed by fluorimetry. AFU, arbitrary fluorescence units. g, qPCR analysis 
shows that knockdown of Axin2is associated with reduced expression of Ascl1 
in reprogramming intermediates on day 8 and CiPCs on day 11. Scr., scramble. 
In e-g, data are presented as meant+s.e.m. ofn=3 independently treated 
biological replicates. Statistical significance was assessed by two-tailed 
Student’s test. 


Material transfer and cell fusion are mechanisms that can explain 
improvement in visual function; however, these mechanisms have not 
been reported in the late stages of retinal degeneration, during which 
viable host photoreceptors are required in order for improvement to 
occur”**’, These mechanisms are of most concern in photoreceptor 
loss-of-function models—such as Gnati—in which the photoreceptors 
have limited or no degeneration and retinal morphology and structure 
are preserved. To avoid the possibility of donor-host cell fusion and 
material transfer, we chose to transplant CiPCs into rdI mice at atime 
point (postnatal day (P)31) at which no rod photoreceptors exist”**°. 
Although rod loss is almost complete at P31, we examined the possibility 
that existing cones or bipolar cells could participate in CiPC material 


transfer or cell fusion; however, we found that these are unlikely to 
have a major role (Extended Data Figs. 4d, 8c). 

Our mechanistic studies reveal that mROS-mediated NF-KB activation 
directly regulates the expression of Asc/1 and the reprogramming of 
fibroblasts into CiPCs. Mitochondrial signalling is reported to have arole 
in cellular homeostasis and neuronal function*’ ’. To our knowledge, 
this reportis the first to demonstrate that mitochondria-to-nucleus sig- 
nalling acts as a mediator for direct chemical reprogramming (Extended 
Data Fig. 10b). Moreover, mitochondria-translocated AXIN2 prob- 
ably generates mROS, although the mechanism is currently unknown. 
Induction of the reprogramming of fibroblasts to CiPCs by our group 
of five compounds reveals a new function of the mitochondria that 
may provide valuable knowledge for the generation of other cell types. 

Although CiPCs have therapeutic potential, a lack of proliferation—as 
isthe case for native photoreceptors—and low conversion efficiency are 
the main impediments for atranslational application. We anticipate that 
optimization of our current protocol may be beneficial for obtaining 
large numbers of CiPCs. For example, temporal modulation of IWR1 
in the protocol resulted ina significant increase in the conversion of 
HADF to CiPCs (Fig. 1f, Extended Data Fig. 8d, right). Overall, CiPCs 
area promising cell-replacement candidate and may lead toascalable 
therapy for vision restoration. 
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Methods 


Generation of chemically induced photoreceptor-like cells 

All small molecules were diluted in DMSO or DMEM according to 
the manufacturer’s instructions. Approximately 50,000-80,000 
MEFs (passage 2) were seeded into each well of a 12-well plate (0.1% 
gelatin-coated overnight) and cultured overnight. On day 1, the medium 
was replaced with photoreceptor induction medium (PIM; Supplemen- 
tary Tables 1,2) containing V (0.5 mM), C (4.8 1M), R(2 uM) and F (10 pM). 
On day 3, fresh PIM containing V, C, R, F plus I (10 uM) were added 
into each well. The medium was then replaced with PIM containing 
VCRFI on days 4-7 depending on cell appearance. S (3 nM), T (100 pM) 
and R (1p1M) were added on day 8 with fresh medium containing all 
the small molecules mentioned above. On day 10-11, GFP* cells were 
collected and analysed for gene expression. For CiPC maturation, PIM 
was replaced with photoreceptor differentiation medium with the small 
molecules VCRF and cultured up to day 15-16 (Supplementary Tables 1, 
2). CiPC survival decreases considerably after day 11 with only a few 
mature cells obtained at day 15-16. For human adult dermal fibroblast 
(HADF) reprogramming, cells were seeded (to 95 to 100% confluence) in 
IMR90 medium. On day 1, PIM containing VCRFI (at the same concentra- 
tions as stated above) was added to the wells. PIM containing VCRFISTR 
was added on day 5. NRL staining was performed between day 7-10. We 
noted more autofluorescent cells after fixation and these cells were not 
considered as reprogrammed. The medium and all small molecules were 
replenished daily throughout the conversion period depending upon 
appearance. Human Fetal Lung Fibroblasts (HFL1) conversion to CiPCs 
was similar to mouse protocol, except that 5 uMIWRI1 was added on day 
2.Thesmall molecules §S, T and R, as well as brain-derived neurotrophic 
factor, glial cell line-derived neurotrophic factor and neurotrophin-3 
were added on day 5. On day 6-8, DsRed* cells were collected for analysis. 


Mouse models and MEF isolation and lineage tracing 

All animal studies and animal care were performed in accordance with 
relevant guidelines and regulations and approved by the Institutional 
Animal Care and Use Committee at the University of North Texas Health 
Science Center. Nrl-GFP reporter mice were a gift from A. Swaroop 
(National Eye Institute, NEI) and were used to generate MEFs. Nrl-GFP 
cells were not identified in either starting fibroblast culture**. rtTa mice 
(Jackson Laboratory, 006965) were crossed with Nrl-GFP mice for 
unrelated experiments. Embryos from Fsp1-Cre mice (Jackson Labo- 
ratory, stock# 012641) crossed with R26-LSL-tdTomato mice (Jackson 
Laboratory, stock# 007909) were used for lineage tracing. Embryos 
were examined for tdTomato expression before isolation of MEFs. 
td-Tomato’ cells obtained from FACS were used for reprogramming 
to CiPCs according to the protocol described above, and CiPCs were 
stained with anti-NRL antibody according to the method detailed below. 
CBA/J (rd1) mice (stock# 000656) and C57BL/6 mice (stock# 000664) 
were purchased from the Jackson Laboratory. 


Subretinal injection of CiPCs into rd1 mice 

After conversion, GFP* CiPCs were sorted by FACS from multiple 
10-cm dishes and resuspended in IMR90 medium. For subretinal injec- 
tion, mice (P31 and P24) were first anaesthetized by intramuscular 
injection of 85 mg kg‘ ketamine and 14 mg kg" xylazine. A 30-gauge 
blunt-end needle attached toa 10 ml Nanofil syringe (World Precision 
Instruments) was inserted into the puncture site (0.5-1 mm under the 
limbus line) with visualization aided by the use of asurgical microscope 
(Leica). CiPCs (80,000; 1.5-2 pl per eye) or saline (1.5-2 pl per eye) were 
delivered into the subretinal space in the superior temporal quadrant. 
After injection, the needle was left in place for a few seconds to reduce 
reflux and enable maximal cell release before being slowly withdrawn. 
The eyelid was returned to its original position and a drop of Triple 
Antibiotic (Equate, Walmart) ointment was applied. Mice were warmed 
ona37 °C bed until fully awake. 


Pupillometry 

Similar to a previously published protocol*®, dark-adapted mice 
were used to capture images under infrared illumination. CiPC- 
transplanted eyes that showed ERG improvement were exposed to 
low-irradiance white light through a light guide from a100-W goose arc 
lamp at arange of intensities. For the illuminance curve experiments, 
dark-adapted mice were subjected to a series of light exposures (with 
increasing illuminance) for 10 s on weeks 2-4 after transplantation. A 
complete intensity series was performed in one eye before retesting the 
other eye at similar intensities. A gap of at least 2 min was maintained in 
between measurements. Pupil constriction was imaged in the contralat- 
eral, non-transplanted eyes. An infrared light-emitting diode was used 
throughout the experiment for background illumination. Aninfrared 
camera (Sony, DCR-HC96) was used to acquire images. The pupil area 
for each eye was measured before and after light exposure with ImageJ 
software (National Institutes of Health). For each mouse, the change 
in pupil constriction was represented by the difference between the 
pupil area measured in the dark and in the light. 


Quantitative PCR and PCR with reverse transcription 

Total RNA was extracted using a kit (Zymo Research, Cat# R1050) 
according to the manufacturer’s instructions. RNA (1 pg) was con- 
verted to cDNA using a High Capacity cDNA Reverse Transcription 
kit (Applied Biosystems, 4368814). Isolated RNAs were treated with 
DNAase-I before c-DNA synthesis. A thermal cycler from Applied Biosys- 
tems and OneStep Plus real-time PCR were used for amplification. qPCR 
was performed using Fast SYBR Green Master Mix (Applied Biosystems, 
4385612). Results were normalized with glyceraldehyde 3-phosphate 
dehydrogenase or hypoxanthine-guanine phosphoribosyltransferase 
and fold change was calculated using the 2“““' method. A list of primers 
is provided in Supplementary Table 3. 


Transcriptome analysis 
Quantification. RNA-seq data analysis was performed at the gene level 
as previously* with Ensembl data release 87. The gene level count matrix 
was then TMM-normalized (TMM, trimmed mean of M values) using the 
edgeR (v3.18.1) package in the R (v3.4.0) programming environment, 
as previously described**. 


Gene expression clustering and heat maps 

Gene expression clustering was performed on selected genes using 
the affinity propagation algorithm by using negative distance as the 
similarity for ordering the genes in each set before drawing the heat 
maps. The heat map function was developed in-house and is available 
upon request. 


ATAC-seq 

Cells were washed twice with fresh medium and 1:100 volume of 
100XDNase buffer (250 mM MgCl, and 50 mM CaCl inH,O) and1:100 
volume of 100X DNAase/ solution (Worthington, LS002006, resus- 
pended in HBSS at 20,000 units per ml) was added to the medium. 
Cells were incubated in DNase at 37 °C for 30 min ina tissue culture 
incubator. After washing the medium with PBS, cells were then 
trypsinized, centrifuged (800g for 5 min) and resuspended in 500 ul 
of growth medium containing 5% DMSO ina slow cooling chamber. 
Approximately 100,000 cells were then shipped to Active Motif to 
perform ATAC-seq and bioinformatics analysis. 


Immunohistochemistry, subcellular fractionation and 
immunoblotting 

Paraformaldehyde (4%) was used to fix enucleated eyes. Cryo-embedded 
eyes were then sectioned with a thickness of 14 um. Fixed eye sections 
were analysed with primary and secondary antibodies listed in Supple- 
mentary Table 4. DAPI (0.1%) was used to stain the nucleus in mounting 
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medium. Images were taken using a Zeiss LSM510 confocal/Leica DMi8 
florescence microscope. Subcellular fractionation was performed 
following a kit protocol from Thermo Fisher Scientific (Cat# 89874). 
For immunoblotting, total protein was extracted with commercially 
available lysis buffer (Thermo Scientific, Cat# 89900) and then con- 
centration of proteins was measured using a BCA protein assay kit 
(Thermo Scientific, Cat# 23227). An equivalent amount of proteins was 
loaded into each well, immunoblotted and antibody-stained using a 
standard procedure (Supplementary Table 4). SuperSignal West Femto 
maximum sensitivity substrate (Thermo Scientific, Cat# 34094) was 
used to develop the subcellular fractionation western blot. 


Electroretinogram 

After a minimum dark-adaptation period of 12 h, mice were anaesthe- 
tized by intraperitoneal injection of 85 mg kg ‘ketamineand14mgkg™ 
xylazine. Preparation was performed under a dim red light (<S0 lux). 
ERG analyses were performed using an Espion system (Diagnosys). For 
the assessment of scotopic response, a stimulus intensity of 40 cd-s m7 
was presented to the dark-adapted dilated eyes. The amplitude of the 
scotopic a-wave was then measured from the pre-stimulus baseline to 
the a-wave trough. The amplitude of the b-wave was then measured 
from the trough of the a-wave to the crest of the b-wave. A total of 15 
repeated flashes and measurements were averaged to produce the final 
waveform. The amplitude of the photopic b-wave was then measured 
from the trough of the a-wave to the crest of the b-wave. At the begin- 
ning of the day, the response of wild-type C57BL/6 mice (aged over P21, 
n>2) was recorded and quantified to ensure proper device calibration. 


Light-aversion test 

The visual discrimination (light/dark) test was conducted in an appa- 
ratus that consists of black opaque (100%) acrylic test chambers 
(30.48 x 15.24 x 30.48 cm (length, width, height)). This chamber was fur- 
ther divided into equal-sized compartments (15.24 x 15.24 x 30.48 cm) by 
the addition of an insert, to create a dividing wall inthe centre. Further, 
to create light and dark zones, one compartment was illuminated with 
dim ambient light (SO + 1.5 1x) and the other compartment was kept dark 
(~ 0.11x). The light and dark compartments were connected by an 
opening (5 x 5cm). The position of the mouse within the apparatus was 
recorded using a photocell-based system (Model 71-CPPX, Omnitech). 
The acrylic chambers were housed separately in sound-attenuating 
chambers (Model 71-ECC, Omnitech). Ambient noise within the cham- 
bers was 64 dB and testing took place under dim illumination. 

Mice were maintained in the testing room overnight (about 12 h) in 
dark conditions in their home cage with free access to food and water. 
Each mouse was allowed to habituate to the testing apparatus (both 
sides) for 10 min while in the dark. After habituation, one side of the 
apparatus was illuminated with an ambient light at around 50 Ix and 
the mouse was allowed to roam freely between each compartment for 
5 min. The time spent in the dark and light compartments was recorded 
by a photocell-based system. 


Optomotor task 

The testing apparatus was a chamber (39 x 39 x 32.5 cm) with mirrored 
floors and ceilings. Attached to each of the four walls was a 20-inch 
computer monitor facing inwards. In the centre of the chamber was a 
platform (7-cm diameter) that was elevated approximately 15 cm from 
the floor. When a mouse was placed on the platform, a video camera 
positioned in the ceiling of the apparatus enabled the behaviour of 
the mouse to be clearly visible during testing. Acomputer program 
was used to project visual stimuli (vertical gratings) onto the monitors 
(OptoMotry, CerebralMechanics). The gratings were rotated at 12°s7, 
producing the appearance of a virtual rotating cylinder. The moving 
gratings elicited a tracking behaviour and the visual acuity thresh- 
old and contrast sensitivity was determined for each eye rotating ina 
clockwise direction (when testing the left eye) or in the anticlockwise 


direction (when testing the right eye). Light levels were lowered to 50 Ix 
and 65 Ix by using neutral density filters placed on the screens. Before 
any testing took place, the mice were dark-adapted in their home cages 
with free access to food and water. 


Visual acuity. A mouse was placed ontop of the platform and allowed 
to acclimatize for a short period of time. Testing began when the mouse 
was no longer actively moving around. The visual acuity threshold 
was determined with contrast set at 100% and a grating of low spatial 
frequency (0.042 cycles per degree) as previously described in a re- 
cent report. When tracking behaviour was observed, the same stimuli 
were rotated in the anticlockwise direction (thus effectively testing 
the right eye). A staircase method of determining acuity threshold 
was implemented. A series of gratings of increasingly higher spatial 
frequencies were presented (rotating in one, then in the alternative 
direction) as long as the mouse indicated that it could detect the grat- 
ing movements. When the mouse ceased to respond to a particular 
spatial frequency, alower frequency grating was presented; when the 
mouse responded to a frequency, the frequency was increased. The 
acuity threshold was set as the highest spatial frequency to which the 
animal responded. 


Contrast-sensitivity function. A contrast threshold was measured for 
six spatial frequencies (0.031, 0.064, 0.092, 0.103, 0.192, 0.272 cycles 
per degree). The initial contrast was set at 100% for each of the above 
spatial frequencies. Contrast was lowered until the mouse ceased to 
respond to the particular grating. The lowest contrast that elicited 
a response was assessed for each of the six spatial frequencies, and a 
contrast-sensitivity function was calculated with the formula 100/C, 
where Cis the lowest contrast that elicits a response at a particular 
frequency. This data transform means that when a mouse can see at a 
very low contrast setting, the sensitivity number will be large and will 
indicate better visual performance at a particular spatial frequency. 


Immunofluorescence and laser scanning confocal microscopy 
For confocal microscopy, converted and sorted cells were seeded on 
a chambered coverglass coated with 0.2% gelatin (Nunc). For mROS 
detection, cells were stained with MitoTracker Red (500 nM; Molecular 
Probes, M7512) for 30 min, washed, and used for downsream applica- 
tions. For antibody staining, cells were fixed with 4% paraformaldehyde 
for 15 min and permeabilized with 0.25% Triton. Cells were then stained 
with primary antibody overnight at 4° C. After incubation with the 
appropriate secondary antibody, images were captured using a Zeiss 
LSM 510 confocal microscope. Data analysis and 3D reconstruction 
were performed with the assistance of ZEN lite software. Alexa Fluor 
633-,549- and 488-tagged secondary antibodies were used. For count- 
ing Nri-GFP* CiPCs, ten 20x visual fields were selected randomly. The 
list of primary antibodies is provided in Supplementary Table 4. 


Measurement of mitochondrial ROS and TNF, H,O, treatment 
during reprogramming 

Mitochondrial ROS was detected and quantified using a published 
protocol”. In brief, CiPCs, Axin2-depleted MEFs or reprogramming 
intermediates were incubated with MitoSOX Red (500 nM; Molecu- 
lar Probes, M36008) for 30 min. Cells were then washed twice with 
1x PBS and fluorescence was monitored with a microplate reader set 
to an excitation wavelength of 510 nm (excitation bandwidth, 10 nm) 
and an emission wavelength of 595 nm (emission bandwidth, 35 nm). 
mROS generation was also imaged with a fluorescence microscope 
and quantified with Leica Application Suite X Software. The region 
of interest was outlined for each cell on the image after background 
subtraction. The average intensity within each region of interest was 
measured and exported to an Excel spreadsheet. The average change 
in fluorescence was calculated for each type of cell. There were at least 
three replicates for each condition. 


For H,O, production during reprogramming, D-galactose (0.5 mM) 
and galactose oxidase (0.015, 0.05, 0.1,1U ml) were added on day 4 
of reprogramming along with all the small molecules**. On day 10-11, 
the number of CiPCS was quantified under the microscope (20 fields 
were randomly selected for counting). For TNFa treatment, cells 
were treated with three different concentrations of TNFa (20 ng mI", 
50ng mI‘, 100 ng mI”) on day 8 and the number of cells was counted 
on day 11. 


Measurement of oxygen consumption rate and extracellular 
acidification rate by Seahorse assay 

About 25,000 CiPCs in 100 pl medium with all small molecules were 
seeded in XF24 cell culture plates. Blank medium was added to the 
appropriate wells for background correction. Plates were then placed 
inanincubator at 37 °C with 5% CO, for 6h. This step enables the cells to 
adhere onthe surface of the plate. Next, 150 pl growth medium contain- 
ing all small molecules was added to each well followed by incubation 
overnight at 37 °C and 5% CO.,. The following morning, the experiment 
was performed in a Seahorse Extracellular Flux Analyzer according 
to the manufacturer’s instruction at the Metabolic Phenotyping Core 
facility, University of Texas South Western Medical Center. 


FACS 

For FACS, CiPCs were trypsinized (0.25%) and passed through a40-um 
Nyloncell strainer (Fisher Scientific, Cat# 08-777-1) and suspended 
in PBS containing 3% bovine serum. Initiating MEFs were used as 
a negative control. Cells were then sorted in a Beckton-Dickinson 
LSRII Flow cytometer at the Flow Cytometry core facility, University 
of Texas Southwestern Medical Center and a Sony SH800 cell sorter 
at Flow Cytometry core facility, University of North Texas Health 
Science Center. Sorted cells were collected in IMR90 medium, cen- 
trifuged, and processed for RNA extraction and other downstream 
applications. 


RNAi and generation of shRNA transduced MEFs 

Lentiviral doxycycline-inducible shRNA constructs for Axin2 (GE 
Dharmacon, 12006), Tfam (GE Dharmacon, 21780) and RelA (Sigma, 
TRCNO0023583) were purchased for lentivirus preparation. Lenti- 
viral supernatants were collected for 4 days and concentrated using 
lenti-X concentrator (Clonetech, Cat# 631231). An aliquot of concen- 
trated lentivirus was then used to transduce PO Nri-GFP MEFs. For 
Ascl1 knockdown experiments, control (scramble) shRNA and lenti- 
viral shRNA (a gift fromJ. Johnson, UT-Southwestern Medical Center; 
cloned into PLKO.1 (Addgene)) constructs were transduced in MEFs. 
All lentiviral-transduced cells were selected for 3d in the presence of 
puromycin (1 pg mI"). Drug-selected cells were then used for chemi- 
cal conversion and shRNA induction (7fam and Axin2) was performed 
between day 4 and day 8 with doxycycline. Finally, CiPCs were quanti- 
fied on day 11. 


Western blotting 

Whole-cell lysates were prepared as follows: cells were washed twice 
with ice-cold PBS, then lysed in RIPA buffer (Thermo Fisher Scientific) 
with 1X protease and phosphatase inhibitor cocktail (Thermo Fisher 
Scientific) onice for 15-20 min. The mitochondrial and cytosolic frac- 
tions were prepared using acommercial kit (Thermo Fisher Scientific, 
Cat# 89874). Protein concentration was measured using a Pierce BCA 
protein assay kit (Thermo Fisher Scientific), and optical density at 562 
nm was measured using a Biotek Synergy 2 Microplate Reader (BioTek 
Instruments). The NuPAGE protein gel system was used to separate cell 
lysates by electrophoresis. Blots were incubated in appropriate primary 
antibodies overnight and incubated with western HRP substrate (Mil- 
lipore) or SuperSignal West Femto Maximum Sensitivity Substrate 
(34094). Images were acquired using the Innotech FluorChem HD2 
imaging system (Alpha Innotech). 


ChIP and amplification 

Real-time PCR-based quantitative ChIP analysis was performed using 
a ChIP Assay Kit according to the manufacturer’s instructions (EMD 
Millipore, Cat# 17-295). In brief, 20,000 cells were cross-linked with 
formaldehyde (to a final concentration of 1%) for 10 min at room 
temperature with gentle agitation. Cell sonication was performed 
such that the size of the chromatin would be between 300-500 bp. 
After pre-clearance with protein A agarose, chromatins were used for 
immunoprecipitation with specific antibodies targeted to NF-KB-p65. 
Immunoprecipitated chromatins were amplified with a Whole-Genome 
Amplification Kit (Sigma, WGA2). Amplified products were identified 
in agarose gel. Primers were designed to amplify 60-100 bp amplicons 
and were based on sequences from the Ensembl Genome Browser for 
mouse. Products were amplified with Fast SYBR Green Master Mix in 
a 20-l reaction. The amount of product was determined relative toa 
standard curve of input chromatin. Dissociation curves showed a single 
product for the amplicons. Primers for ChIP analysis are detailed in 
Supplementary Table 3. 


BrdU labelling and staining 

BrdU (1 uM) was added on day 3 of reprogramming, and conversion 
was continued until day 11. CiPCs were washed twice with 1x PBS and 
immunofluorescence was performed according to the published pro- 
cedure (https://www.abcam.com/protocols/brdu-staining-protocol). 


Preparation of Nri-DsRed promoter reporter cells 

To prepare the Nri-DsRed promoter reporter, we digested out the 
promoter and reporter fragment from a commercially available 
vector, pNri-DsRed (Addgene, 13764), using restriction enzymes. 
These fragments were then cloned into a gateway entry vector pEN- 
TR2B (Thermo Fisher Scientific, A10463). Positive clones were then 
shuffled into a destination vector, pLentiX1 Zeo DEST (Addgene, 
17299). The final product was then used for lentivirus preparation. 
Human fetal lung fibroblasts (HFL1; ATCC, CCL153) were transduced 
with the lentivirus and selected with Zeocin (200 pg mI"; InvivoGen, 
ant-zn-1) for 8 days. 


Statistical analysis 

All data are presented as mean +s.e.m. Statistical significance was 
determined using Student’s t-test and one-way ANOVA using Graph- 
Pad Prism Software (GraphPad Software); P values are indicated in 
the figures. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Supporting RNA-seq and ATAC-seq data are deposited in the Gene 
Expression Omnibus under accession codes GSE138520 (RNA-seq) 
and GSE138521 (ATAC-seq) respectively. Source data for Figs. 1-4 and 
Extended Data Figs. 1-3, 5-8 are available within the manuscript files. 
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Extended Data Fig. 1| Preparation of Nri-DsRed lentiviral reporter 
construct and lineage tracing. a, DNA gel shows cloned Nrl and DsRed 
fragments digested out fromthe pENTR2B vector. This experiment was 
performed once. b, Expression of Nr/-DsRed in human CiPCs on day 8.c, The 
number of Nri-GFP* cells upon subtraction of the indicated small molecules 
from the mixture. d, The number of Nri-GFP* cells after conversion (on day 11) 
upontreatment with each small molecule alone. e, Scheme showing the 
generation of the mouse model for lineage tracing. f, Flow sorting of FSP1Cre- 
tdTomato* MEFs. Left, scatter plot from FACS; middle, MEFs before cell 
sorting; right, MEFs after cell sorting. Scale bars, 25 um. This experiment was 
performed once. g, Cells expressing NRL are FSP1-tdTomato-positive on day 
11, suggesting that they originate from fibroblasts. h, Percentage of cells that 
are positive for both NRL and tdTomato, compared with those that arejust 
positive for NRL. Inc, d, h data are presented as meants.e.m. ofn=3 
independently treated wells. Statistical significance was assessed using a 
two-tailed Student’s t-test. 
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Extended Data Fig. 2 | BrdU staining, transcriptome analysis and 
functional testing of CiPCs. a, Images of BrdU-stained CiPCs on day 11. 

Scale bars, 10 pm. b, Heat map analysis of RNA-seq data for the indicated 
cell-type-specific genes. c, Timeline for subretinal transplantation of CiPCs and 
functional analysis. d, Scotopic b-wave after transplantation of converted 
CiPCs.n denotes the number of biologically independent animals. 
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Extended Data Fig. 3 | Functional analysis CiPCsin retinal degeneration 
mice (rd1). a—c, Analysis for the scotopic a-wave of ERG after transplantation 
of CiPCs at day 45 (a), day 59 (b) and day 77 (c). d, Scatter plot of pupil 
constriction measurements for CiPC-injected rdJ mice taken three months 
after transplantation. e, f, Measurement of visual acuity (e) and contrast 
sensitivity (f) for CiPC-injected mice that demonstrated improved pupil 
response after transplantation and control mice. OR, optokinetic response. 
For the CiPC-injected rd1 experiment, repeated values are plotted (forn=1 
mouse). Ina-e,n denotes biologically independent animals and the 
experiment inf was performed twice. 
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Extended Data Fig. 4| Transplanted CiPCs express photoreceptor-specific 
markers and connected tothe inner retinathree months after 
transplantation. a, Expression of recoverin (Recov, red) in CiPCs (green). 

b, Expression of rhodopsin (Rho, red) in GFP-expressing CiPCs. c, Expression 
of rod ribbon-synapse protein ribeye (red) in transplanted CiPCs (green). 

The bottom panel shows a magnified version of the image (arrowhead). 

d, Close apposition of transplanted CiPCs (green) with PKC-positive (red) rod 
bipolar cells (white arrowhead). e, Expression of synaptic marker protein 
synaptophysin (red, arrowhead) in transplanted CiPCs (green) (co-localization, 
yellow). d,e, Left, images at 20x magnification; right, images at higher 

digital magnification. In a-e, experiments were repeated twice with similar 
results. Scale bars, 10 pm. 
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Extended Data Fig. 5| Effect of NF-KB knockdown, H,0,, TNFa and the 
overexpression of Ascl1 onthe reprogramming of fibroblasts to CiPCs. 

a, qPCR analysis following shRNA-mediated knockdown (kd) of RelA (which 
encodes NF-KB) in MEFs. b, qPCR analysis of AsclJ expression after the 
reprogramming of RelA-depleted MEFs on day 11. c, Number of Nri-GFP cells 
after the conversion of RelA-knockdown MEFs on day 11. d, Effect of different 
concentrations of H,O,, generated by galactose and galactose oxidase (GAO), 
onthereprogramming of fibroblasts to CiPCs. e, Effect of cytokine (TNF, 50ng mI’) 
on CiPC conversion. f, qPCR analysis showing the overexpression (OE) of Ascl1 
in MEFs. g, Phase contrast micrographs of MEFs overexpressing Ascl1 before 
and after reprogramming in inducing medium. No Nri-GFP* cells 

were evident up to day 15. h, Fluorometric measurement of mROS in 
Ascll-overexpressing cells on day 8 of reprogramming. Dataare presented as 
mean+s.e.m. of n=3 independently treated wells. Statistical significance was 
assessed using a two-tailed Student’s t-test. 
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Extended Data Fig. 6| NF-xB-luciferase activity and generation of mROS 
during the reprogramming of fibroblasts to CiPCs. a, A luciferase activity 
assay reveals decreased activation of NF-KB upon treatment with MitoTEMPO 
and increased NF-kB activity upon depletion of TFAM on day 11 of 
reprogramming to CiPCs. b, Luciferase activity shows decreased NF-KB 
activation in Axin2-knockdown MEFs on day 8 and day 11 of reprogramming. 
SM, small molecules. c, MEFs stained with MitoSOX showed low levels of mROS 
production (scale bar, 33 pm). d, Plot of mROS accumulation on day 8, as 
visualized using fluorescence microscopy after MitoSOxX staining, in wild-type 
(WT) Nri-GFP MEFs treated with all compounds (top), Axin2-knockdown 
Nrl-GFP MEFs treated with all compounds (middle) and control Nri-GFP MEFs 
(bottom). e, qPCR analysis following Tfam depletion in MEFs using shRNA. 

f, Fluorometric measurement of MROS (after MitoSOX staining) in 
MitoTEMPO-treated and TFAM-knockdown cells on day 11. g, ChIP assay shows 
reduced binding of activated NF-KB near the Asc/1 locus upontreatment 

with MitoSOX. Ina, b, e-g, data are presented as meants.e.m. ofn=3 
independently treated wells. Statistical significance was assessed using a 
two-tailed Student’s t-test. 
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Extended Data Fig. 7 | Mitochondrial localization of AXIN2 in converted 
cells anda mito-stress test for CiPCs. a, Western blot after subcellular 
fractionation of CiPCs showed that AXIN2 was localized in the mitochondria. 

C, cytoplasm; M, mitochondria. For gel source data, see Supplementary 

Fig. 1e, f.b, Micrograph of AXIN2 (red)- and GFP (green)-stained CiPCs after 
z-stack and 3D reconstruction, also showing that AXIN2 localizes inthe 
mitochondria. Ina, b experiments were performed once. c, CiPCs were found 
to have a reduced oxygen consumption rate (OCR) compared with MEFs and 
day-8 reprogramming intermediates, as measured using a Seahorse assay. 
Oligo, oligomycin; FCCP, carbonyl cyanide-4-phenylhydrazone. d, The 
indicated mitochondrial parameters of CiPCs, quantified in terms of OCR using 
a Seahorse assay, compared with those of MEFs and day-8 reprogramming 
intermediates. e, The extracellular acidification rate (ECAR) of CiPCs, MEFs 
and day-8 reprogramming intermediates was measured using a Seahorse assay. 
f, The indicated mitochondrial parameters of CiPCs, quantified in terms of 
ECAR using a Seahorse assay, compared with those of MEFs and day-8 
reprogramming intermediates. In c-f, data are presented as mean+s.e.m. of 
n=3 independently treated wells. 
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Extended Data Fig. 8 | Conversion of human fibroblasts into CiPCs and 

in vivo functional testing of mouse CiPCs. a, qPCR analysis shows expression 
of the indicated photoreceptor-specific genes in CiPCs. b, Fluorescence 
microscopy image demonstrates the expression of NRL in CiPCs. a, b, HADF 
from patients with retinitis pigmentosa. This experiment was performed three 
times with similar results. c, Confocal microscopy image of the retina of an rd1 
mouse, immunostained with GFP (red) and PNA (peanutagglutinin, green), 
three months after transplantation of CiPCs. d, Left, confocal microscopy 
image of a retinal section showing GFP-positive CiPCs three months after 
transplantation. This experiment was repeated twice with similar results. 
Right, conversion of human HFLIs (original protocol, see Methods) and a 
modified protocol using HADFs (Fig. 1f) to CiPCs. e, The correlation between 
the number of transplanted CiPCs and the extent of pupil constriction.n=6 
eyes classified as pupil responders and n= 8 eyes classified as pupil 
non-responders. f, CiPC survival in the eyes of rd1 mice classified as pupil 
responders and pupil non-responders. Then values denote the number of eyes. 
Ina, d,e, data are presented as mean+s.e.m. ofn=3 independently treated 
wells. 
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Extended Data Fig. 9 | Gene expression and chromatin analysis of CiPCs. 
a, Pairwise gene expression comparison of RNA-seq data demonstrates that 
neuronal markers are increased, glial markers are not activated and fibroblast 
genes are silenced. Numbers on both axes represent logarithmic base 
2-transformed counts per million values for each gene. b, Heat map analysis 
(from RNA-seq data) of photoreceptor genes along with other neuronal and 
fibroblast genes. c, Fibroblast-specific gene expression as determined by 
RNA-seq during reprogramming. The expression of Colla1, Col2a1, Thyl, Ctgf, 
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representative of three independently treated samples. 
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Synaptophysin, Rabbit, Abcam, 1:500 (IF), Ab14692; CtBP2 (ribeye), Rabbit, BD, 1:500(IF), 612044. 


Validation All of the antibodies used in this study were used according to supplied instructions by the manufacturer or referenced 
publication. 
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Laboratory animals For pupil reflex and light aversion experiments, CiPCs were injected in to P31 or P24 rd1 mice (Jackson Laboratory, 000656). 
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(Jackson Laboratory, Ai9) crossed embryos were used for a standard procedure for MEF preparation (Li et al., 2015). All studies 
and animal care were performed in accordance with relevant guidelines and regulations as approved by the animal care and use 
committee at University of North Texas Health Science Center and EyeCRO. 


Wild animals This study did not involve wild animals. 
Field-collected samples This study did not involve samples collected from the field. 
Ethics oversight All animal studies and animal care were performed in accordance with relevant guidelines and regulations and approved by the 


animal care and use committee at University of North Texas Health Science Center. 
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Methodology 
Sample preparation Chemically reprogrammed cell population was PBS-washed, trypsinized (0.25%) and passed through a 40uM nylon cell strainer 
(Fisher Scientific, 08-777-1) and suspended in PBS containing 3% bovine serum. 
Instrument Beckton-Dickinson LSRII Flow cytometer, Sony SH800 cell sorter 
Software BD FACSDIVA, Sony SH800, Flowjo 


Cell population abundance — Approx 0.2% CiPCs were obtained from mouse embryonic fibroblasts and 0.9% were obtained from human fetal lung fibroblasts. 


Gating strategy Cells were first identified by FSC/SSC so that all cells are visible on the plot. FSC-H and FSC-A were used to identify single cells. 
Positive cell population was sorted out based upon green/red fluorescence expression. 
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Small interfering RNAs (siRNAs) are essential for proper development and 
immunity in eukaryotes’. Plants produce siRNAs with lengths of 21, 22 or 24 


nucleotides. The 21- and 24-nucleotide species mediate cleavage of messenger 
RNAs and DNA methylation”, respectively, but the biological functions of the 
22-nucleotide siRNAs remain unknown. Here we report the identification and 
characterization of a group of endogenous 22-nucleotide siRNAs that are generated 
by the DICER-LIKE 2 (DCL2) protein in plants. When cytoplasmic RNA decay and 
DCL4 are deficient, the resulting massive accumulation of 22-nucleotide siRNAS 
causes pleiotropic growth disorders, including severe dwarfism, meristem defects 
and pigmentation. Notably, two genes that encode nitrate reductases—N/A1 and 
NIA2—produce nearly half of the 22-nucleotide siRNAs. Production of 22-nucleotide 
siRNAs triggers the amplification of gene silencing and induces translational 
repression both gene specifically and globally. Moreover, these 22-nucleotide 
siRNAs preferentially accumulate upon environmental stress, especially those 
siRNAs derived from N/A1/2, which act to restrain translation, inhibit plant growth 
and enhance stress responses. Thus, our research uncovers the unique properties 
of 22-nucleotide siRNAs, and reveals their importance in plant adaptation to 
environmental stresses. 


Small RNAs (sRNAs) are produced by different proteins, have different 
lengths and exert their effects on the cell through different mecha- 
nisms. In general, cellular double-stranded RNA (dsRNA) precursors 
that are produced by RNA-dependent RNA polymerases and hairpin 
microRNA (miRNA) precursors are cleaved into 20-24-nucleotide 
(nt) sRNA duplexes by proteins of the Dicer family’?. One strand of 
the sRNA duplex is loaded into an Argonaute (AGO) protein to form 
an RNA-induced silencing complex (RISC)*. The Arabidopsis genome 
contains four Dicer-like (DCL) proteins, DCL1 to DCL4, with a variety 
of roles. DCL1is the major DCL protein required for the biogenesis of 
miRNAs that target homologous RNAs for cleavage and/or transla- 
tional repression*; DCL3 cleaves dsRNA precursors into 24-nt siRNAs, 
which are engaged in RNA-directed DNA methylation®; and DCL4 is 
required for the production of 21-nt siRNAs, directing mRNA cleav- 
age®. DCL2is thought to regulate the biogenesis of viral 22-nt siRNAs’. 
Although the roles of 21- and 24-nt siRNAs are well understood, the lack 
of endogenous 22-nt siRNAs has greatly restricted their study. Here, 
we explored the biological roles and regulatory mechanism of 22-nt 
siRNAs by using genetic mutants that produce abundant quantities 
of these molecules. 


Massive production of 22-nt siRNAs 

In Arabidopsis, bidirectional (5’-to-3’ and 3’-to-5’) decay of cytoplas- 
mic RNA, mediated by the ETHYLENE INSENSITIVE5/EXORIBONUCLE- 
ASE4 (EIN5/XRN4) and SUPER KILLER2 (SKI2) proteins, elaborately 
averts endogenous gene silencing by suppressing the accumulation of 
coding-transcript-derived siRNAs®. We began by investigating plants 
with mutations in these RNA-degrading enzymes and in DICER proteins. 
We found that simultaneous loss of cytoplasmic RNA decay and DCL4 
activity (einS dcl4 and ski2 dcl4 double mutants) led to pleiotropic 
growth disorders, including severe defects in meristem activity, leaf 
expansion and pigmentation (Fig. la, b). By contrast, ein dcl2 and ski2 
dcl2 mutants were normal. Interestingly, dcl2and a post-transcriptional 
gene silencing mutant, rdr6, rescued the defects in the einS dcl4 and 
ski2 dcl4 mutants (Fig. 1a, b), suggesting that these disorders result 
from the production of DCL2- and RDR6-dependent 22-nt siRNAs. 

By carrying out SRNA sequencing, we identified abundant 22-nt but 
not 21-nt siRNAs derived from both TRANS-ACTING SiRNA (TAS) and 
non-TAS loci in einS dcl4 and ski2dcl4 mutants (Fig. 1c and Extended Data 
Fig. 1a, b), consistent with the competing relationship between DCL4 
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Fig. 1| Disruption of cytoplasmic RNA decay and DCL4 triggers massive 
production of 22-nt siRNAs and causes growth disorders. a, b, Images of 
20-day-old plants of the indicated wild-type (Col-0) and mutant genotypes. 
Scale bars, 2cm. Three biological repeats were performed with similar results. 
c, Abundance of 21- and 22-nt siRNAs (TPM, tags per million) from all genes, 
excluding miRNAs and tasiRNAs. Data are presented as means +s.d.;n=4 


and DCL2 in substrate processing”. In contrast with 21-nt trans-acting 
siRNAs (tasiRNAs)*°, no phasing was detected in the TAS-derived 22-nt 
siRNAs (Extended Data Fig. 1c; ‘phasing’ means that the sRNAs were 
generated precisely in a head-to-tail arrangement, starting froma 
specific nucleotide). In einS dcl4 and ski2 dcl4 plants, we identified, 
respectively 1,182 and 182 protein-coding genes that produce 22-nt 
siRNAs, with 111 genes in common between the two conditions (Supple- 
mentary Tables 1, 2 and Extended Data Fig. 1d, e). Two genes encoding 
nitrate reductases, M/A1 and NIA2 (ref. '°), were particularly impor- 
tant, contributing 45% and 48% to the total amount of 22-nt siRNAs in 
einS dcl4.and ski2 dcl4 plants, respectively (Fig. ld-f). Among the top 
20 loci, SMAX1-LIKE 4 (SMXL4) and SMXLS5, which are required for 
phloem development”, also showed abundant generation of 22-nt 
siRNAs in both mutants (3% in einS dcl4 and 9% in ski2 dcl4) (Fig. 1d-f). 
From aset of HD-Z/PI/I/ transcription-factor genes (PHB, PHV, REVand 
ATHB-8), we detected abundant quantities of 22-nt siRNAs in ski2dcl4 
mutants (21% of the total amount of 22-nt siRNAs), but not in ein5 dcl4 
mutants (Fig. 1d, e and Extended Data Fig. le). 


22-nt siRNAs repress mRNA translation 


Next, we carried out transcriptome profiling to assess the effect of 
22-nt siRNAs on gene expression. We found that only asmall subset of 
the genes that generate 22-nt siRNAs showed decreased expression 
levels in einS dcl4 and ski2 dcl4 mutants (Fig. 2a, b), including SMXL4 and 
SMXLS (Fig. 2c and Extended Data Fig. 2a). The vast majority, including 
NIA1 and NIA2, showed unchanged or higher expression levels in einS 
dcl4 and ski2 dcl4 plants in both the total and the cytoplasmic MRNA 
fractions (Fig. 2c and Extended Data Fig. 2a—d). Nevertheless, NIA1 and 
NIA2 proteins were hardly detectable, even in the presence of MG132, 
an inhibitor of the 26S proteasome, arguing against the proteasomal 
degradation of the two proteins (Fig. 2d and Extended Data Fig. 2e, f). 
Meanwhile, mRNAs from polysome-associated N/A1/2 (Fig. 2e, f, 
fractions 8-11) and another two genes that produce 22-nt siRNAs, 
GLOBAL TRANSCRIPTION FACTOR GROUP E2/7 (GTE2/7) (Extended 
Data Fig. 2g), in einS dcl4 plants were lower than in wild-type or in einS 
dcl4 dcl2 plants, suggesting the translational repression of NIA1/2 
and GTE2/7 mRNAs by the 22-nt siRNAs. Notably, the total amount of 
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polysome-associated mRNAs (fractions 6-11) also decreased in einS dcl4 
and ski2dcl4 mutants; this was restored by the dc/2 mutation, implying 
global translational repression (Fig. 2f). By comparison, despite the 
similar growth defects and abundant 21-nt siRNAs*, the total amount of 
polysome-associated mRNAs (fractions 6-11) in einS ski2 mutants was 
decreased to a much lesser extent than in einS dcl4 plants (Extended 
Data Fig. 3a—d). Furthermore, polysome-associated N/AI and N/IA2 
mRNAs (fractions 6-11) in ein5 ski2 mutants were comparable to those 
in the wild type (Extended Data Fig. 3e-g). Therefore, the production 
of 22-nt, but not 21-nt, siRNAs represses translation of their cognate 
mRNAs, as well as global translation. 


AGO1 is required for 22-nt siRNA activity 


Further evidence supporting the ability of 22-nt siRNAs to repress 
translation came from the ago1-27 mutant plant. This is ahypomorphic 
allele that has normal AGOI ‘slicer’ activity but is defective in transla- 
tional repression’. When agol-27 was crossed into ein5 dcl4 and ski2 
dcl4 plants, the growth phenotype was rescued and protein levels of 
NIA1/2 were restored (Fig. 3a, b and Extended Data Fig. 4a). This effect 
was specific to agol: the ago2 and ago4 mutants could not rescue the 
growth phenotype (Extended Data Fig. 4b, c). HUA ENHANCER 1 (HEN1) 
is required for SRNA methylation”; the henJ mutant partly rescued the 
defects of both einS dcl4 and ski2dcl4 mutants (Extended Data Fig. 4d, e), 
suggesting that 22-nt siRNAs are methylated. However, ALTERED 
MERISTEM PROGRAM 1 (AMP1), which is involved in miRNA-mediated 
translational repression”, was not required for 22-nt siRNA activity: 
the amp1-30 mutation did not rescue einS dcl4 and ski2 dcl4 mutants 
(Extended Data Fig. 4f-h), suggesting that the translational repression 
mechanisms of miRNAs and 22-nt siRNAs are different. 

Given the genetic evidence that AGO] is required for 22-nt siRNA 
activity, we analysed AGO1-associated sRNAs by AGO] immunopre- 
cipitation followed by sRNA sequencing. AGOI predominantly asso- 
ciated with 21-nt miRNAs in wild-type and einS dcl4 dcl2 plants, but 
switched to binding 21-nt miRNAs and 22-nt siRNAs in einS dcl4 mutants 
(Extended Data Fig. 5a, b). The AGO1-associated 21- and 22-nt sRNAS 
showed a5’-uridine (U) preference (Extended Data Fig. 5c, d), consist- 
ent with previous findings’*. Of 642 identified genes that produced 
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AGO1-associated 22-nt siRNAs, 529 (82%) overlapped with genes that 
generate 22-nt siRNAs in einS dcl4 mutants (Extended Data Fig. Se). The 
abundance of AGO1-associated 22-nt siRNAs positively correlated with 
that of total 22-nt siRNAs in einS dcl4 plants (Extended Data Fig. 5f). As 
expected, large amounts of 22-nt siRNAs from MA1/2 were associated 
with AGO] (Fig. 3c). Therefore, 22-nt siRNAs with a 5’-terminal U can 
be selectively loaded into AGO1. 


We further confirmed the translational repression activity of 22-nt 
siRNAs using an in vitro cell-free system”. A synthesized 22-nt or 21-nt 
siRNA targeted to the same site of N/A2transcript was assembled with 
AGO1to form RISC (Extended Data Fig. 6a—d). After incubating the RISC 
with MA2 transcripts, 21-nt siRNAs showed more rapid cleavage than 
22-nt siRNAs (Fig. 3d, e), partly because of the higher AGO1-loading 
efficiency of 21-nt siRNA (Extended Data Fig. 6d). Meanwhile, 22-nt and 
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Fig. 3 |22-nt siRNAs mediate translational repression and siRNA 
amplification in an AGO1-dependent manner. a, b, Images of 20-day-old 
plants of the indicated genotypes. Scale bars, 2cm.c, IGV of AGO1-associated 
22-nt-siRNA abundance at WA1/2. IP, immunoprecipitation. d, Northern 
blotting (NB) of N/A2mRNAs labelled with 3x FLAG tag, and anti-FLAG western 
blotting (WB) of the 3x FLAG NIA2 protein at the indicated time points inthe 
presence (‘21-nt’ or ‘22-nt’) or absence (‘No siRNA’) of AGOI-RISC programmed 
with 21- or 22-nt N/A2siRNA in vitro. 18S ribosomal RNA and Ponceau S staining 
were used, respectively, as RNA and protein loading controls. Numbers tothe 
left of the SDS-PAGE gels indicate the molecular mass (in kDa) of proteins. 

e, f, Relative accumulation of full-length 3x FLAG NIA2 mRNA (e) and 3x FLAG 


NIA2 protein (f) from d. Data shown as relative amounts after normalizationto 
the mRNA level at 1 min (e) or the protein level at 120 min (f) inthe ‘no siRNA’ 
reaction (set as‘1’). g, Ratio of relative 3x FLAG NIA2 protein and mRNA levels at 
indicated time points, using the quantitative results frome, f. Values from 
Imin, 5 min and 15 min were not included because the protein level was 
undetectable at these time points. h, IGV of 22-nt siRNAs derived from NIA1/2 
genes. i,j, Heat map showing abundance of 22-nt siRNAs from1,182 and 182 
genes producing 22-nt siRNAs. The numbers of individual biological 
experiments are as follows: a, b,n=3;c,d,n=2;h,n=4. For gel source data, see 
Supplementary Fig. 1. For sequences of 3x FLAG NIA2 and AGOI, see 
Supplementary Fig. 2. 
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Fig. 4| Functional analysis and environmental induction of 22-nt siRNAs. 
a, Images of eight-day-old seedlings of the indicated genotypes grownon 
Murashige and Skoog (MS) medium. Scale bar, 0.5 cm. b, Propidium iodide 
staining of 9-day-old plant roots of the indicated genotypes. The white 
triangles point to the upper boundaries of the root apical meristem region. 
Scale bar, 50 pm. c, Abundance of 21- and 22-nt siRNAs from all genes, excluding 
miRNAs and tasiRNAs, in10-day-old Col-0 and dc/4-2 plants, grown on normal 
MS medium or nitrogen-depleted MS medium (MS-N). d, Accumulation of 
NIA1/2-generated 22-nt siRNAs in plants. e, Images of 12-day-old seedlings 
grown on indicated media. Scale bar, 5 mm. f, Relative DCL2, DCL4, SKi2and 
FINS gene-expression levels in plants. NA, not applicable. g, NIALand NIA2 
protein levels in plants. HSP90 was used asa control to normalize the protein 
level. NIALand NRratios represent the relative protein level under each 


21-nt siRNAs led to a marked decrease in NIA2 protein levels (Fig. 3d, 
f). By calculating the ratio of NIA2 protein levels to transcript levels, 
we found that 22-nt siRNAs tended to repress NIA2 protein synthesis 
more efficiently than 21-nt siRNAs (Fig. 3g), supporting their potent 
ability to repress translation. 

The finding that 22-nt miRNAs initiate phased tasiRNA production®*”” 
prompted us to examine the function of complexes containing 22-nt 
siRNAs and AGO1 in triggering transitive sSRNA amplification. Notably, 
22-nt siRNAs—including those derived from N/A1/2 and SMXL4/5—were 
greatly eliminated by ago1-27 or another hypomorphic allele, ago1- 
45 (ref. °), which similarly rescued the defects of ein dcl4 and ski2 
dcl4 mutants (Fig. 3h-j and Extended Data Fig. 7a-d). Considering the 
modest effect of ago1-45 and ago1-27 mutations on sRNA loading and 
stability”, these results indicate that AGO] function—particularly its 
ability to repress translation—is associated with efficient biogenesis 
of 22-nt siRNAs. 


22-nt siRNAs in stress adaptation 


We next investigated the biological roles of 22-nt siRNAs in regulating 
plant growth and adaptation. We found that ski2 dcl4 mutants showed 
obvious root-growth inhibition and seedling retardation on soil, par- 
tially resembling nial nia2 double mutants” (Fig. 4a, b and Extended 
Data Fig. 8a, b). Both einS dcl4 and ski2 dcl4 plants showed abnormal 
phloem development and downregulation of multiple phloem-related 
genes (CALS7 and NEN4) (Extended Data Fig. 8c, d), in line with previ- 
ous findings that SMVXL4/5 produced 22-nt siRNAs occasionally in dcl4 
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condition after normalization to NIA1or NR protein amount in Col-0 plants on 
MS medium (set as‘J1’). h, Relative N/A1/2 gene expression levels in plants. 

i,j, Polysome profiling showing the global translation efficiency of 10-day-old 
Arabidopsis plants under nitrogen deficiency condition (i) or in nial nia2 
mutant (j). The sucrose gradient absorbance (A 3,0) of fractions was monitored. 
k, Proposed model for translational repression by 22-nt siRNAs andits rolein 
plant stress adaptation. The 22-nt siRNAs accumulate in response to 
environmental stress, suchas nitrogen starvation, through the inhibition of 
RNA decay and DCL4 activities. They further cause translational repression 
and trigger transitive sRNA amplification. Data are presented as means +s.d. 
(c, f,h). The numbers of individual biological experiments are as follows: 

a,c, d, f-j,n=3;b,n=2;e,n=6. All P-values are from two-tailed Student’s ¢-tests. 
For gel source data, see Supplementary Fig. 1. 


mutants”, and that phloem formationis impaired in smx/4 smxl5 double 
mutants”. Excessive anthocyanin accumulation was observed in einS 
dcl4and ski2dcl4 plants (Extended Data Fig. 8e), consistent with previ- 
ous reports that disturbed nitrogen metabolism or defective phloem 
formation activates anthocyanin biogenesis”. All of these growth 
defects and the altered expression of phloem-related genes observed 
in einS dcl4 and ski2dcl4.can be rescued by dcl2 (Fig. 4a, band Extended 
Data Fig. 8). Furthermore, gene-ontology analysis of differentially 
expressed genes revealed that myriad plant stress-response pathways 
(including, for example, pathways involving the hormone abscisic acid 
(ABA) or the response to salt) were activated, whereas plant growth 
responses (for example, photosynthesis and cell division) were inhib- 
ited in einS dcl4 and ski2 dcl4 plants (Extended Data Fig. 9a—j and Sup- 
plementary Tables 3, 4). Together, these results suggest an important 
role of 22-nt siRNAs in fine-tuning plant growth and stress responses. 

We next explored the functionality of endogenous 22-nt siRNAs under 
environmental stress conditions. In wild-type plants, the production of 
22-nt siRNAs was noticeably induced when nitrogen was deficient; this 
induction was more evident in dcl4 mutants (Fig. 4c)—for example, in 
the case of N/A1/2-derived 22-nt siRNAs (Fig. 4d). Accordingly, wild-type 
plants displayed shortened root length, altered leaf pigmentation and 
decreased NIA1/2 protein abundance upon nitrogen deficiency, while 
dcl4 mutants showed hypersensitive responses (Fig. 4e). One possible 
explanation could be the induction of DCL2 and downregulation of 
RNA-decay genes (SK/2 and FINS) and DCL4 (Fig. 4f). It has also previ- 
ously been reported that DCL4 activity is compromised when nitrogen 
is deficient”. Notably, nitrogen depletion led to an almost complete 


loss of NIA1/2 proteins in wild-type plants (Fig. 4g), which can hardly be 
explained by the moderate accumulation of 22-nt siRNAs alone. Given 
that the MA1/2 genes are nitrate-induced”>, a substantial reduction of 
NIA1/2 expression upon nitrogen depletion might also be involved 
(Fig. 4h). Moreover, polysome profiling revealed that loss of NIA1/2 
proteins—in either nitrogen-depleted plants or nial nia2 mutants— 
resulted ina reduction of ribosome-associated mRNAs, particularly in 
fractions 1-5 (Fig. 4i,j). This global translational slowdown is probably 
due to a shortage of amino acids, the synthesis of which relies highly 
onNIA1/2- and nitrite-reductase-dependent ammonium production”®. 
Thus, we propose that under conditions of nitrogen starvation, in addi- 
tionto reduced N/A1/2expression, compromised RNA decay and DCL4 
activity cause an increase in 22-nt siRNAs and translational repres- 
sion of N/A1/2 mRNAs. This combinatory effect causes a substantial 
loss of NIA1/2 proteins and thus reduces amino-acid levels, slowing 
global translation and restraining plant growth. This is reminiscent 
of the response of yeast to nutrient deprivation, in which translation 
is inhibited by decreasing the splicing of genes that encode ribosomal 
proteins in order to enhance survival?””*. 

Meanwhile, we found that treatment with ABA also caused notable 
accumulation of 22-nt siRNAs in dc/4 plants, with 90% of them being 
derived from MA1/2, contributing to the translational repression of 
NIA1/2 (Extended Data Fig. 10a—e). Accordingly, the dcl4 mutant was 
hypersensitive to ABA treatment and displayed more severe growth 
inhibition than wild-type plants (Extended Data Fig. 10f). By search- 
ing and reanalysing previously published databases, we found that 
high salinity also led to an elevation of overall 22-nt siRNA produc- 
tion in wild-type plants, including siRNAs derived from NM/A1/2. As 
expected, salt treatment decreased the accumulation of NIA1/2 proteins 
(Extended Data Fig. 10g-i). Taken together, these data prompt us to 
propose that the production of 22-nt siRNAS, particularly from N/A1/2, 
might be essential for plants to adapt to environmental stresses. The 
efficient repression of NIA1 and NIA2 could be acommon regulatory 
mechanism when plants encounter environmental stresses that allows 
them to deliberately switch from growth to defence. 


Discussion 


Here we have reported the massive production of endogenous 22-nt 
siRNAs, particularly from the NW/A1/2 genes, in plants deficient in both 
DCL4 activity and cytoplasmic RNA decay. This increased biogenesis of 
22-nt siRNAs requires RDR6 and DCL2 activity. AGO1is required for both 
their biosynthesis and their functionality. Distinct from 21-nt and 24-nt 
siRNAs, 22-nt siRNAs are efficient in mediating translational repres- 
sion and less effective in target cleavage, and also trigger transitive 
RNA interference and silencing amplification (Fig. 4k). The induction 
of 22-nt siRNAs under certain stress conditions leads to translational 
slowdown and growth inhibition. At present, it is unknown how 22-nt 
siRNAs mediate translational repression and transitive RNA interfer- 
ence. Itis conceivable that AGO1 loaded with 22-nt siRNAs may prefer to 
stall ribosomes—a phenomenon observed for some 22-nt miRNAs”. 
A 22-nt-siRNA-AGO1 complex associated with stalled ribosomes or 
cleaved mRNA fragments would recruit RDR6 to initiate dsRNA syn- 
thesis and siRNA amplification. 
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Methods 


Nostatistical methods were used to determine sample size. The experi- 
ments were not randomized and the investigators were not blinded to 
allocation during experiments and outcome assessment. 


Plant materials and growth conditions 

All mutants used in this study were in the Col-0 background. Arabidop- 
sis thaliana mutants, including einS-1 (ref. °°), ski2-2 and ski2:3 (ref. °), 
dcl2-1 (ref. *), dcl4-2 (ref.°), rdr6-11 (ref. *), ago1-45 (ref. *°), agol-27 
(ref. *°), hen1-8 (ref. **), ein5-1 dcl4-2, ski2-2 dcl4-2, einS-1 ski2-3, ein5-1 
dcl4-2 dcl2-1, ski2-2 dcl4-2 dcl2-1, and einS-1 ski2-3 dcl42 dcl2-1 (ref. *) 
were described previously. The nial-3 (Salk_148487C) and nia2-1 
(Salk_138297C) mutants were obtained from the Arabidopsis Biologi- 
cal Resource Center (https://abrc.osu.edu/). Other double and triple 
mutants used here were generated by genetic cross. 

We used MS basal salt and nitrogen-depleted MS basal salt from 
Phyto Technology (catalogue numbers M524 and M3531) to prepare 
full-nutrition MS medium and nitrogen-depleted MS (MS-N) medium. 
Surface-sterilized seeds were plated on the media (pH 5.7) for 3 days 
at 4 °C and then kept in the greenhouse for another 6-7 days (22 °C; 
16 h/8 h photoperiod) before being transferred to soil. 


RNA-seq and sRNA-seq 

Total RNA and sRNA were extracted from the aerial part of 20-day-old 
plants (6 days on MS medium and 14 days on soil) or 10-day-old seed- 
lings on MS or MS-N medium. For mRNA sequencing (mRNA-seq), we 
extracted total RNA using the RNeasy Plant Mini Kit (Qiagen). Libraries 
were prepared using an mRNA preparation kit (Illumina), following the 
standard protocol. Sequencing of paired-end 150-base-pair mRNAs 
was conducted by BGI (Shenzhen) company using the Illumina HiSeqx 
ten platform. Raw reads were filtered by Trimmomatic (http://www. 
Usadellab.org/cms/?page=trimmomatic)* with default parameters. Fil- 
tered reads were then aligned to the Arabidopsis Col-0 genome (TAIR10 
release) using the tophat2 program” with two mismatches allowed. 
Differentially expressed genes were called by cuffdiff®”. 

For sRNA sequencing (SRNA-seq), SRNA was extracted using the 
miRNeasy Mini Kit (Qiagen). We prepared sRNA libraries using the 
NEBNext Small RNA Library Prep Set for Illumina (catalogue num- 
ber E7330L). Single-end 50-bp sRNA sequencing was conducted by 
BGI (Shenzhen) using the Illumina HiSeq 4000 platform. Raw reads 
were filtered using the fastx_toolkit (http://hannonlab.cshl.edu/ 
fastx_toolkit/). The 18-26-nt clean reads were mapped toa structural 
RNA (ribosomal RNAs, transfer RNAs, small nucleolar RNAs and small 
nuclear RNAs) database (http://rfam.xfam.org/). Then the unmapped 
reads were aligned to the TAIR10 genome using bowtie* with following 
parameters: -v 0 -a-M 3-best-strata. Differential small RNA-generating 
genes were called using the DESeq package in R (https://bioconduc- 
tor.org/packages/release/bioc/html/DESeq.html). Gene-ontology 
enrichment analysis was performed using DAVID Bioinformatics 
Resources 6.8 (ref. *’) and the BINGO package in Cytoscape software*” 
with default parameters. Mapped reads were visualized using IGV", 
version IGV_2.3.72. 

Genes generating 22-nt siRNAs in ein5-1 dcl4-2 (ed) or ski2-2 dcl4-2 
(sd) plants were identified using the following criteria: (i) 22-nt siRNA 
reads ined or sd plants should be at least twofold higher than those in 
Col-0, dcl4-2 and corresponding RNA-decay mutants (ein5-1 for ed or 
ski2-2 for sd); (ii) 22-nt siRNA reads in ed or sd plants should be at least 
twofold higher than their corresponding triple mutants generated by 
crossing with dcl2-1 (ein5-1 dcl4-2 dcl2-1 or ski2-2 dcl4-2 dcl2-1); (iii) the 
false discovery rate (FDR) of all comparisons should be less than 0.01; 
(iv) the sum of 22-nt siRNA reads in seven libraries (Col-O, ein5-1, dcl4-2, 
dcl2-1, ein5-1dcl4-2, ein5-1 dcl2-1and einS-1 dcl4-2 dcl2-1 for einS-1dcl4-2; 
Col-0, ski2-2, dcl4-2, dcl2-1, ski2-2 dcl4-2, ski2-2 dcl2-1 and ski2-2 dcl42 
dcl2-1 for ski2-2 dcl4-2) should be larger than 21. 


To calculate the abundance of 21- or 22-nt siRNAs generated fromall 
genes except for miRNAs and tasiRNAs, we added up 21- or 22-nt siRNAS 
reads from the 5’-UTR, exonic regions, intronic regions and 3’-UTRand 
normalized them to total mapped 18-26-nt siRNAs reads. To calculate 
21- or 22-nt SRNAs generated from TAS genes, we added up 21- or 22-nt 
siRNA reads from all eight annotated 7AS genes and normalized them 
to total mapped 18-26-nt siRNAs reads. 


sRNA phasing score analysis 
Phasing analysis was performed as described” using ShortStack* with 
default parameters (version 3.8.5). 


Polysome profiling 

Polysome profiling was performed as described“ with minor modi- 
fications. In brief, the aerial parts of 20-day-old (Fig. 2f and Extended 
Data Fig. 3d) or 10-day-old (Fig. 4i, j) plants were collected and ground 
into fine powder in liquid nitrogen, then homogenized in 1 ml of 
pre-cold polysome extraction buffer (PEB: 100 mM Tris-HCl, pH 9.0, 
100 mM KCI, 20 mM MgCl, 12.5 mM EGTA, pH 8.0; 75 mM NaCl, 10 mM 
Tris-HCl, pH 7.4,3 mM dithiothreitol (DTT), 100 mM sucrose, 37.5 pg mI" 
cycloheximide, 25 pg mI chloramphenicol, 6.25 p11 20% detergent mix 
(20% Brij-35, 20% Triton X-100, 20% Igepal CA630, 20% Tween 20); 25 pl 
Triton X-100). After thawing on ice for 10 min, the lysates were centri- 
fuged at 4 °C, 14,500 r.p.m. for 20 min. The supernatants were trans- 
ferred toa40-ym filter falcon tube and centrifuged at 4 °C, 14,500 r.p.m. 
for another 15 min. The supernatants were then transferred into a 
1.5-ml tube. For each sample, 700 ul of lysate was loaded on top of a 
10-50% sucrose gradient, and another 70 pl of lysate was used as input 
for total mRNA isolation using Trizol reagent. The samples were then 
centrifuged at 4 °C, 37,000 r.p.m. for 2h and 10 min ina P40ST rotor. 
Polysomal RNAs distributed along the sucrose gradient according to 
their different sedimentation coefficient after ultracentrifugation, and 
were separated using a gradient fractionator system (BioComp Instru- 
ments). Fractions containing monosomes to the heaviest polysomes 
were collected and subjected to RNA isolation using Trizol followed 
by quantitative real-time polymerase chain reaction (PCR). Polysome 
distributions of N/A1/2 (Fig. 2e and Extended Data Fig. 3e-g) and GTE2/7 
(Extended Data Fig. 2g) mRNA levels were normalized against PP2AA3 
(AT1G13320). NIA1/2 and GTE2/7 expression in each polysomal fraction 
was calculated as the percentage of its expression in total RNA. 


Total protein extraction and immunoblot assays 

Plant materials were ground into fine powder in liquid nitrogen and then 
homogenized in protein extraction buffer (100 mM Tris-HCl, pH 6.8, 4% 
SDS, 10% glycerol, 50 mM DTT). The samples were heated for 10 min at 
65 °C and then centrifuged for 10 min at 13,000 r.p.m. at 4 °C. After cen- 
trifugation, the supernatants were separated on 7.5% SDS-PAGE gels, 
and the proteins were transferred to polyvinylidene difluoride (PVDF) 
membranes (Millipore). Immunoblot assays were performed to analyse 
the samples using specific antibodies, including anti-AGO1 antibody 
(Agrisera, catalogue number AS09527), anti-HSP90 antibody (Beijing 
Protein Innovation, catalogue number AbDM51099-31-PU) and anti-NR 
antibody (Agrisera, catalogue number AS08310). Goat anti-mouse (or 
anti-rabbit) immunoglobulin G (IgG) (H+L) horseradish peroxidase 
(HRP)-conjugated secondary antibody (Promega Corporation) was 
diluted 10,000-fold. Anti-NIA1 specific antibody was made by Abclonal 
Biotechnology using the peptide KIVKETEVITTVVDSY DD. Western blot 
images were captured using a Tanon 5200 Multi chemiluminescent 
imaging system (Tanon). 


Gene-expression analysis by quantitative PCR 

Two micrograms of total RNA extracted using Trizol (Invitrogen) 
were used for reverse transcription at 42 °C for 1h with Moloney 
murine leukaemia virus (M-MLV) reverse transcriptase (Promega) 
according to the manufacturer’s instructions. After tenfold dilution 


of complementary DNA product, real-time PCR was performed using 
SYBR Premix Ex Taq II (Tli RNaseH Plus) (Takara Bio) with a Roche 
LightCycler 96 platform. PP2AA3 (AT1G13320) was used as a reference 
gene. We used 40 cycles of amplification for all genes tested herein, 
except for SMXL4 and SMXLS, which require 50 cycles owing to their 
low expression levels. Gene-specific primers used here are listed in 
Supplementary Table 5. 


Cytoplasmic-nuclear fractionation 

Arabidopsis cytoplasmic-nuclear fractionation was performed as 
described***. In brief, 20-day-old plants were collected and ground 
into a fine powder in liquid nitrogen and then homogenized in 1 ml 
lysis buffer (20 mM Tris-HCl, pH 7.5, 20 mM KCI, 2mM EDTA, 2.5 mM 
MgCl,, 25% glycerol, 250 mM sucrose, 5mM DTT, 1,000 U mI RNase 
inhibitor; 2x protease-inhibitor cocktail, 1 mM phenylmethanesul- 
fonyl fluoride (PMSF)) by gentle shaking. The homogenate was fil- 
tered through a double-layer miracloth. The filtered homogenate was 
centrifuged at 4 °C at 1,500g for 10 min to pellet the nuclei, and the 
supernatant consisting of the cytoplasmic fraction was centrifuged 
at 4 °C at 15,000g for another 10 min to further remove debris. Then 
800 ul of supernatant was collected for downstream RNA extrac- 
tion to perform reverse transcription and quantitative reverse tran- 
scription (qRT)—-PCR analysis. The nuclei pellet was washed six times 
with 1 ml NRBT buffer (20 mM Tris-HCl, pH 7.4, 25% glycerol, 2.5 mM 
MgCl, 0.2% Triton X-100 and 20 U mI RNase inhibitor). The sam- 
ple was centrifuged at 4 °C, 1,500g, for 10 min. The supernatant was 
discarded and the nuclei resuspended with 200 pl nuclei storage 
buffer (NSB: 20 mM Tris-HCl, pH 7.5, 25% glycerol, 2.5 mM MgCl, 
440 mM sucrose, and 1x protease-inhibitor cocktail). For quality 
control of cytoplasmic-nuclear fractionation, we used 100 ul of the 
total and cytoplasmic fractions and 30 ul of the nuclear fraction for 
protein extraction and immunoblot assay. We used tubulin protein 
(antibody from Abclonal, catalogue number ACO10) as a cytoplasmic 
marker and histone H3 (antibody from Abclonal, A2348) as a nuclear 
marker. 


Preparation of 3x FLAG NA2mRNA 

We constructed pBYL-3x FLAG_NIA2 by inserting the coding sequence 
of the NW/A2 gene with an amino-terminal 3x FLAG tag into the Ascl site 
of pBYL2 (ref. *”). The 3x FLAG NIA2 mRNA was in vitro transcribed from 
Notl-linearized pBYL-3x FLAG_NIA2 using an AmpliScribe T7 high yield 
transcription kit (Lucigen), followed by capping witha ScriptCap m’G 
capping system (Cell Script) and poly(A)-tailing with A-Plus poly(A) 
polymerase tailing kit (Cell Script). 


In vitro RNA silencing assay 

In vitro RNA silencing was essentially performed as described”, with 
slight modifications. In brief, 15.6 pl of BY-2 lysate, 7.8 pl of substrate 
mixture (containing ATP, ATP-regeneration system, and amino-acid 
mixture), and 1.56 pl of 300 nM AGOI mRNA were mixed, and incu- 
bated at 25 °C for 30 min. Then, 3.12 pl of 1.5 tM NIA2 siRNA duplexes 
(GeneDesign) were added to the reaction, and incubated at 25 °C for 
90 min. Subsequently, 1.5 pp of 20 nM 3x FLAG NIA2 mRNA was added. 
Ateach time point, 2-pl aliquots of the reaction mixture were trans- 
ferred into two new tubes, and used for western and northern blots. 
For detection of 3x FLAG-NIA2, anti-DDDDK-tag monoclonal antibody 
(diluted at 1/5,000) (Medical and Biological Laboratories, catalogue 
number M185-3L) was used as the primary antibody. Anti-IgG (H+L) 
(mouse) pAb-HRP (diluted at 1/5,000) (Medical and Biological Labora- 
tories) was used as secondary antibody. The 5’-P-radiolabelled DNA 
oligonucleotide mixture (5’-acctggctcgagacgagcgtattggcgattatctacag 
aggccgc-3’, 5’-ccatggcatattccttcttgatgctcgttccgtatttggaaccag 
-3’, 5’-ccttcttcgtcggcgagtt cggcgtctaccaaagaaggtaaaact-3’, 
5’-taatttggtaaaccggagttattccggttccacctgccaacattg-3’) was used for 
detection of 3x FLAG-NJA2 mRNA. 


In vitro RISC assembly assay 

The 3x FLAG-AGO1 mRNA*‘ was in vitro translated as described above. 
Then, the radiolabelled NIA2-siRNA duplex was added to the reaction 
mixture, and incubated at 25 °C for 90 min. Subsequently, the reac- 
tion mixture was incubated with Dynabeads protein G coated with 
anti-Flag M2 antibody (Sigma-Aldrich) onarotator at 4 °C for1h. Then, 
the beads were washed three times with lysis buffer (30 mM HEPES, 
pH 7.4,100 mM KOAc and 2mM Mg(OAc),) containing 1% TritonX-100 
and 800 mM NaCl. After treatment with proteinase K, the samples 
were mixed with an equal volume of 2x formamide dye”, incubated 
for 5 min at 68 °C, and loaded onto a 15% urea PAGE denaturing gel. 
After drying, the gel was analysed using PhosphorImager (Typhoon 
FLA 7000, GE Healthcare). It took 15-30 min for the in vitro system to 
produce detectable translation products once the mRNA was added; 
hence, no band was detected in the first three lanes of each condition 
in the western blot section (Fig. 3d). 


Anthocyanin content measurement 

The anthocyanin content was measured as reported*’. Samples used 
for anthocyanin extraction were preweighted (usually 0.1-0.5 g), and 
1Imlanthocyanin extraction buffer (propanol/HCI/H,0O at 18/1/81) was 
added to each sample. After overnight incubation at room tempera- 
ture inthe dark, samples were centrifuged at 14,000 r.p.m. for 20 min. 
Supernatants were diluted twofold before measuring the absorbance 
(A) at 535 nm and 650 nm, and then the relative anthocyanin content 
was calculated using the following formula: 2 x (A535 — A¢so) per gram 
of fresh weight. 


Aniline staining of callose depositions in cotyledons 

Aniline staining was performed as described™. In brief, Arabidopsis 
plants were grownon MS median for 6 days and then transferred to soil 
for another 12 days. Detached cotyledons were destained in acetic acid/ 
ethanol (1/3) buffer over night until the cotyledons were transparent; 
the saturated destaining solution was replaced during the process. Then 
the destained cotyledons were washed three times with 150 mM K,HPO,, 
eachtime 10 min. After wash, the cotyledons were incubated in 150 mM 
K,HPO, buffer containing 0.01% aniline blue (staining solution) for Sh 
ina Folcon tube wrapped in aluminium foil for light protection. After 
that, the samples were embedded in 50% glycerol for further observa- 
tion. Callose deposition was observed using a fluorescence microscope 
(Zeiss Axio Imager M2), with an excitation wavelength of 370 nm for 
aniline blue. The emission maximum for aniline blue is 509 nm. 


Measurement of root length 

For quantitative root-length measurements, images of 8-day-old 
seedlings grown on vertical MS medium were captured using a 
digital camera. Roots of 20 seedlings were measured using ImageJ 
software (https://imagej.nih.gov/ij/). Values shown are average lengths 
(means +s.d.) of 20 roots of the indicated genotypes. 


AGO1limmunoprecipitation and SRNA-seq 

Total protein was extracted from the aerial parts of 20-day-old plants 
with immunoprecipitation buffer (SO mM Tris-HCl, pH 7.5, 150 mM 
NaCl, 10% glycerol; 0.1% NP-40, 5 mM MgCl, 2x proteinase inhibitor 
cocktail). Samples were incubated with immunoprecipitation buffer at 
4 °C for 40 min with gentle rotation before centrifugation. After a first 
centrifugation at 4 °C, 13,000 r.p.m., for 15 min, the supernatants were 
transferred to a new 2-ml tube and centrifuged at 4 °C, 13,000 r.p.m., 
for another 25 min. Then the extracts were incubated with AGO1 
antibody for 2 h at 4 °C with rotation. Pierce protein A/G magnetic 
beads (Thermo Scientific) were added and incubated for another 2h 
at 4 °C with rotation. Beads were collected with a DynaMag-2 magnet 
(Thermo Scientific) and washed six times with wash buffer (50 mM 
Tris-HCl, pH 7.5, 150 mM NaCl, 10% glycerol, 0.5% NP-40;5 mM MgCl, 
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1x proteinase-inhibitor cocktail), each time for 5 min. Then sRNAs were 
extracted from immunoprecipitates using 800 ul Trizol reagent, pre- 
cipitated with a twofold volume of ethanol, and subjected to sSRNA 
library construction using a NEBNext small RNA library prep set for 
Illumina (NEB, catalogue number E7330L). Libraries were sequenced 
with the Illumina Novaseq 6000 platform under paired-end, 150-bp 
conditions; two biological replicates were performed for each geno- 
type. Read 1 of each paired-end read was used for further analysis as 
in the sRNA-seq analysis pipeline. Sequence-conservation analysis of 
AGO1-associated sRNAs was performed using Weblogo 3 with default 
parameters (http://weblogo.threeplusone.com). 


Propidium iodide staining of Arabidopsis roots 

Roots of 9-day-old Arabidopsis seedlings were stained with 1 pg mI 
propidium iodide solution (stock solution 1 mg ml; Thermo Scien- 
tific, P3566) for approximately 5 min on a slide. A coverslip was then 
placed on the top for confocal microscopy (Zeiss LSM880) with 40x 
water immersion. The excitation wavelength and emission maximum 
of propidium iodide are, respectively, 536 nm and 617 nm. We used 14 
(Col-O and ski2-2) or 15 (dcl4-2, ski2-2 dcl42 and ski2-2 dcl42 dcl2-1) cap- 
tured root images to calculate the number of meristematic cortex cells. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Sequencing data are available at the NCBI Gene Expression Omnibus 
(GEO; https://www.ncbi.nIm.nih.gov/geo/) under accession number 
GSE136164. Source gel data for immunoblots and radiograms (Figs. 2-4 
and Extended Data Figs. 2, 4, 6,10) are provided in Supplementary Fig. 1; 
source data for all graphs (Figs. 1-4 and Extended Data Figs. 1-3, 5, 7-10) 
are also provided and are available with the online version of the paper. 
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Extended Data Fig. 1| Differential expression analysis of siRNAs in ein5-1 
dcl4-2and ski2-2dcl4-2 plants. a, Normalized tasiRNA abundance in each 
genotype of 20-day-old plants. TPM, tags per million. Data are presented as 
means +s.d.;n=4 biologically independent samples. b, IGV of 21- and 22-nt 
siRNA abundanceat eight TAS gene loci.c, 21- and 22-nt SRNA phasing score of 


tasiRNAs from representative TASIa and TASIc lociin the indicated genotypes. 


The y-axis shows the sRNA phasing score. See Methods for calling of sRNA 


phasing scores. d, Venn diagram depicting 22-nt-siRNA-generating genes that 
overlap between einS-] dcl4-2 (1,182 genes) and ski2-2 dcl4-2 (182 genes) plants. 
e, The top 20 gene loci and their FDRs in einS-1 dcl4-2 (left, ed) and ski2-2 dcl4-2 
(right, sd) plants, ranked by 22-nt siRNA abundance in each double mutant. 
Genes inred produce 22-nt siRNAs in both einS-1 dcl4-2 and ski2-2 dcl4-2 plants. 
c, Col-0; ed, einS-1 dcl4-2; sd, ski2-2 dcl4-2. FDR values are from Benjamini- 
Hochberg analysis; n =4 biologically independent samples. 
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Extended Data Fig. 2 | N/A1and NJA2mRNA and protein levels in ein5-1dcl4- 
2and ski2-2dcl4-2 plants, and translational states of GTE2/7in ein5-1dcl4-2 
plants. a, mRNA-seq read abundance of N/A1/2 and SMXL4/5 genes inthe 
indicated genotypes. b, Relative expression levels of N/A1/2 detected by qRT- 
PCRat 15:00 and 18:00 in 20-day-old Col-0, ein5-1 dcl4-2, ski2-2 dcl4-2 and einS-1 
ski2-3 plants. c, Separation of cytoplasmic and nuclear fractions of Col-0, ein5-1 
dcl4-2 (ed) and einS-1 dcl4-2 dcl2-1 (edd) plants. Tubulin and histone H3 proteins 
were used as cytoplasmic and nuclear markers, respectively. Cyto, cytoplasm; 
Nuc, nucleus. d, Relative expression level of N/AJ and N/A2 genes detected by 
qRT-PCR at 18:00 in the total and cytoplasmic fractions of 20-day-old plants. 
e, Verification of anti-NIA1and anti-NR antibodies (the latter recognizing both 


NIALand NIA2) using nia1-3 and nia2-1 null alleles. HSP90 was used asa loading 
control. f, Protein levels of NIA1/2 detected by western blot at 15:00 and 18:00 
in 20-day-old plants; sd, ski2-2 dcl4-2; sdd, ski2-2 dcl4-2 dcl2-1; ed, einS-1 dcl4-2; 
edd, einS-1 dcl4-2 dcl2-1; es, einS-1 ski2-3; esdd, einS-1 ski2-3 dcl4-2 dcl2-1. HSP90 
was used asa loading control. g, GTE2/7 mRNA levels were normalized against 
PP2AA3 levels (locus AT1G13320). GTE2/7 expression in each polysomal 
fraction was calculated as the percentage of its expression in total RNA. Data 
are shownas means + s.d. (b, d) or means with individual data points (technical 
replicates) (g). Numbers to the right of gels in c-findicate the molecular mass 
(in kDa) of proteins. Numbers of individual biological experiments are: a, n=4; 
b-d,n=3;e-g,n=2.For gelsource data, see Supplementary Fig. 1. 
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Extended Data Fig. 3 | Accumulation ofsiRNAsand polysomeprofilesofthe (at 260nm) was monitored in different fractions. Experiments were repeated 
einS-1ski2-3 mutant. a, Representative images of 20-day-old Col-0,ein5-1ski2- — twice withsimilar results. e-g, Polysome distribution of W/A1/2transcripts, 


3and einS-1 ski2-3 dcl4-2 dcl2-1 plants. Scale bar, 2cm. Experiments were with numbers on x-axis corresponding to fractions shown ind. N/A1/2mMRNA 
repeated three times with similar results. b, Normalized abundance of 21- and levels were normalized against PP2AA3 (AT1G13320). Ine, f, NIA1/2 expression 
22-nt siRNAs from all genes, excluding miRNAs and tasiRNAs,ineachgenotype. ineachpolysomal fraction was calculated as the percentage of its expressionin 
Datashownas means +s.d.;n=4 biological independent samples. c, IGV of total RNA. Ing, TUB2was used as acontrol. Data in e-g are presented as means 
21- and 22-nt siRNAs from representative N/A1/2 gene loci in 20-day-old plants and individual data points (technical replicates). Experiments were repeated 
of each genotype. d, Polysome profiling of global translation efficiency in twice with similar results. 
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protein levels at 15:00 in 20-day-old plants of the indicated genotypes. HSP90 to the right of gels indicate the molecular mass (in kDa) of proteins. The 
was used as acontrol. b-g, Genetic interaction of einS-1 dcl4-2 and ski2-2dcl4-2 numbers of individual biological experiments are as follows: a, h, n=2; 
with ago2-1 (b,c), ago4-1 (b, c), hen1-8 (d, e) and amp1-30 (f, g). Representative b-g,n=3.For gel source data, see Supplementary Fig. 1. 
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Extended Data Fig. 6| Scheme for RNA silencing by 21- and 22-nt siRNAS 

in vitro. a, Scheme underlying the experimental procedure of Fig. 3d. 

See Methods for further details. b, Structures of 21- and 22-nt siNIA2 siRNAS. 
The guide strands shown in red were radiolabelled. The grey strands showthe 
passenger strand. The 5’-ends of siRNAs were phosphorylated and the 3’-ends 
were methylated. c, Schematic of 3x FLAG NA2mRNA and siNIA2 siRNA target 
site. d, We incubated 1.5 pM radiolabelled NIA2-siRNA duplexes (21- and 22-nt) 
within vitro translated 3x FLAG-AGO1 in BY-2 lysate at 25 °C for 90 min. 
AGO1-RISC was then isolated using anti-FLAG antibody. Input (1/20) and 
AGO1-associated (IP (FLAG)) 21- and 22-nt siRNAs were then separated on 
denaturing gels and analysed by autoradiograph. Experiments were repeated 
twice with similar results. For gel source data, see Supplementary Fig. 1. 
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Experiments were repeated three times with similar results. c, IGV of 22-nt plants) genes that produce 22-nt siRNAs in the indicated genotypes. 
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Extended Data Fig. 8 | Root growth and phloem formation are impaired in 
einS-1dcl4-2 and ski2-2dcl4-2 plants. a, Root length of 8-day-old plants of 
indicated genotypes. Data are means +s.d.;n=20 biologically independent 
roots. b, Number of meristematic cortex cells in 9-day-old plants. Dataare 
means +s.d.;n=14 (for Col-O and ski2-2) and 15 (for dcl4-2, ski2-2 dcl4-2 and ski2- 
2dcl4-2 dcl2-1) biologically independent roots. c, Cotyledons from 18-day-old 
plants shown under bright field or with aniline-based callose visualization. 
Scale bars, 500 pm. n=S biologically independent cotyledons with similar 


results. See Methods for further details. d, Relative expression of 
phloem-related genes (CALS7 and NEN4) in the indicated genotypes of 
20-day-old plants. Data are means +s.d.;n=3 biologically independent 
experiments. e, Measurement of relative anthocyanin content (AA/g.FW: 
absorbance at 525nm minus absorbance at 650 nm per gram of fresh weight) in 
20-day-old plants of the indicated genotypes. Data are means +s.d.;n=5 
biologically independent samples. P-values ina, b, dare from two-tailed 
Student’s t-tests. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | Transcriptome profiles and gene-ontology analysis 
of differentially expressed genes in einS-1dcl4-2 and ski2-2 dcl4-2 plants. 
a,c, Venn diagrams depicting numbers of DCL2-dependent differentially 
expressed genes (at least twofold changes, FDR < 0.01) in einS-1 dcl4-2 (a) and 
ski2-2 dcl4-2(c) plants. b, d, Pie charts representing the numbers of upregulated 
and downregulated genes in einS-1 dcl4-2 (b) and ski2-2 dcl4-2 (d) plants. 

e-h, Representative gene-ontology enrichment analysis of upregulated (red) 
and downregulated (blue) genes in einS-1 dcl4-2 (e, f) and ski2-2 dcl4-2 (g, h) 
plants. Circle sizes represent gene numbers and colour gradients indicate 


enrichment significance. Numbers at the bottom are the ratios of genes for 
each indicated biological process inthe total differentially expressed genes. 

i, Heatmap depicting the log, (fold change) of representative genes involved in 
ABA biogenesis and response in 20-day-old Col-0 (c) versus ein5-1 dcl4-2 

(ed), einS-1 dcl4-2 dcl2-1 (edd), ski2-2 dcl4-2 (sd) and ski2-2 dcl4-2 (sdd) plants. 

j, qRT-PCR analysis of an ABA-responsive gene (RD29B) in 20-day-old plants of 
the indicated genotypes; data shown are means +s.d. FDR values used to filter 
the differentially expressed genes ina-h were obtained by Benjamini- 
Hochberg analysis; n =3 biologically independent samples. 
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Extended Data Fig. 10 | ABA and salt treatments induce the production of 
22-nt siRNAs from MA1 and NIA2loci.a, Normalized abundance of 21- and 
22-nt siRNAs from all genes, excluding miRNAs and tasiRNAs, ineach 
condition. b, Accumulation of 22-nt siRNAs from N/AI/2 gene loci in14-day-old 
plants under1pM ABA treatment. c, Pie chart showing percentages of loci 
generating 22-nt siRNAs under 1M ABA treatment. d, Relative W/A1/2mMRNA 
expression level at 15:00 in 14-day-old plants on MS medium or 1}4MABA 
medium.e, NIA1/2 protein levels at 15:00 in 14-day-old plants on MS medium or 
1M ABA medium. HSP90 was used as a control to normalize the protein level. 
f, Representative images of 14-day-old seedlings grown on the indicated 
medium. Scale bar, 5 mm. g, Normalized abundance of 21- and 22-nt siRNAs 
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from all genes, excluding miRNAs and tasiRNAs, in15-day-old Col-0 plants 
under 250 mM NaCl treatment (data retrieved from GEO, accession number 
GSE66599). All P-values are from two-tailed Student’s t-tests. h, Normalized 
abundance of 22-nt siRNAs from M/A1 and N/A2 genes under the same 
conditions asing).i, NIA1/2 protein level detected by western blot in 8-day-old 
wild-type seedlings upon 250 mM NaCl treatment for 24 h. HSP90 was used asa 
control. e, i, Numbers to the right of gels indicate the molecular mass (in kDa) of 
proteins. Data are presented as means +s.d.ina,d,g,h. The numbers of 
individual biological experiments are as follows: a, d, f,n=3;e,n=2;i,n=4.For 
gel source data, see Supplementary Fig. 1. 
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n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


O A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


Oo For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Triax v1.44A(polysome profiling); 
Tanon 5200 (Western Blot assay); 
Typhoon FLA 7000 (PhosphorImaging); 
ImageJ 1.46r(Root length measurement and Western blot/Northern blot quantification); 
LightCycler 96 v1.1.0.1320 (qRT-PCR assay); 
Zen Black (Confocol assay); 
Zen 2.0 Pro (Phloem observation). 


Data analysis We fully described the software availability in the manuscript as: 
For mRNA-seq: Raw reads were filtered by Trimmomatic (v0.38) with default parameter. Then filtered reads were aligned to the 


Arabidopsis Col-O genome (TAIR10 release) using tophat2 (v2.1.1) program with 2 mismatches allowed. The differential expressed genes 
were called by cuffdiff (v2.2.1). 


For sRNA-seq: Raw reads were filtered by fastx_toolkit (v0.0.14) . The 18-26 nt clean reads were mapped to structural RNA database . 
Then the unmapped reads were aligned to TAIR10 genome using bowtie (v1.2.2) program with following parameters: -v O -a -M 3 --best 
--strata. Differential small RNA-generating genes were called by DESeq package in R. GO enrichment analysis was performed in DAVID 
Bioinformatics Resources 6.8 and the BINGO package in Cytoscape v3.4.0 software with default parameters. Mapped reads were 
visualized in the Integrated Genome Viewer, version IGV_2.3.72; ShortStack software (v3.8.5) were used for small RNA phase analysis. 


Weblogo 3.6.0 was used for small RNA sequence conservation analysis; 


Graphpad Prism 8.0.2 was used for graphs data presentation; 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


Sequencing data are available to public at the NCBI Gene Expression Omnibus (GEO) repository under the accession number GSE136164. Source gel data for 
immunoblots and radiograms (Fig. 2-4, Extended data Fig. 2,4, 6,10) are supplied as Supplementary Figure 1; Source data for all Graphs (Fig.1-4, Extended data 
Fig.1-3,5,7-10) are also provided and are available with the online version of the paper. All other datasets generated within this study available from the 
corresponding author upon reasonable request. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No statistical methods were used to predetermine sample sizes. Required experimental sample sizes were estimated based on previous 
established protocols in the field. The sample sizes were adequate as the experimental results were reproducible. 


Data exclusions No data were excluded from the analysis. 


Replication All experimental findings were reproduced in several independent biological experiments (N) with multiple technical replicates. The number 
of repeats is indicated in the figure legends. Main Conclusions were confirmed in different assays (Immunoblot assay, genetic assay, In vitro 
cleavage assay) handled by multiple researchers and across different labs. 


Randomization Samples of the same genotypes were randomly collected and pooled for downstream experiments. 


Blinding Blinding was not possible as the author who performed the experiments also analyzed the data. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 
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Antibodies 


Antibodies used anti-NIA1(Provided by ABClonal Biotechnology, Cat. WG-03170, dilution 1:2,000 ); anti-NR(Provided by Agrisera, Cat. ASO8 310, 
dilution 1:2,000); anti-AGO1(Provided by Agrisera, Cat. ASO9 527, dilution 1:300 for AGO1-IP); anti- HSP90(Provided by Beijing 
Protein Innovation, Cat. AbM51099-31-PU, dilution 1:400,000); anti-Tubulin(Provided by Abclonal, Cat. ACO10, dilution 1:5,000); 
anti-Histone H3(Provided by Abclonal, Cat. A2348,dilution 1:5,000). 


Validation 1. anti-NIA1 antibody made by this study was validated in Arabidopsis for Western blot and the validated data was presented in 
the manuscript Extended Data Fig.2e; 
Concerning other commercial antibody specificity, we kindly refer to the supplier's websites and datasheets to find statements 
on specificity and citations for the use of the antibodies: 


2. The commercial anti-NR antibody has been well validated in Arabidopsis for Western Blot in the several published papers and 
in our manuscript (Extended data fig.2e), and the antibody profile can be found in the following link: https://www.agrisera.com/ 
en/artiklar/nr-nitrate-reductase-assimilatory.html; 

3. The commercial anti-AGO1 antibody has been well validated in Arabidopsis for Western blot and Small RNA-IP sequencing in 
the published papers, and the antibody profile can be found in the following link: https://www.agrisera.com/en/artiklar/ago1- 
argonaute-1.html; 
4. The commercial anti- HSP90 antibody has been well validated in Arabidopsis for Western blot, and the antibody profile can be 
found in the following link: http://www.proteomics.org.cn/nav/108.html. 

5. The commercial anti-Tubulin antibody has been well validated in Arabidopsis for Western blot in the published papers,, and 
the antibody profile can be found in the following link: https://abclonal.com/instructions/pdf/ACO10; 

6. The commercial anti-Histone H3 antibody has been well validated in Arabidopsis for Western blot in the published papers,, 
and the antibody profile can be found in the following link: https://abclonal.com/instructions/pdf/A2348; 
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Vaccines may reduce the burden of antimicrobial resistance, in part by preventing 
infections for which treatment often includes the use of antibiotics’ *. However, the 
effects of vaccination on antibiotic consumption remain poorly understood— 
especially in low- and middle-income countries (LMICs), where the burden of 


antimicrobial resistance is greatest®. Here we show that vaccines that have recently 
been implemented in the World Health Organization’s Expanded Programme on 
Immunization reduce antibiotic consumption substantially among children under 
five years of age in LMICs. By analysing data from large-scale studies of households, we 
estimate that pneumococcal conjugate vaccines and live attenuated rotavirus 
vaccines confer 19.7% (95% confidence interval, 3.4-43.4%) and 11.4% (4.0-18.6%) 
protection against antibiotic-treated episodes of acute respiratory infection and 
diarrhoea, respectively, in age groups that experience the greatest disease burden 
attributable to the vaccine-targeted pathogens®’. Under current coverage levels, 
pneumococcal and rotavirus vaccines prevent 23.8 million and 13.6 million episodes 
of antibiotic-treated illness, respectively, among children under five years of age in 
LMICs each year. Direct protection resulting from the achievement of universal 
coverage targets for these vaccines could prevent an additional 40.0 million episodes 
of antibiotic-treated illness. This evidence supports the prioritization of vaccines 
within the global strategy to combat antimicrobial resistance’. 


Antimicrobial resistance (AMR) poses a substantial and growing threat 
to global health and economic well-being’. Populations of LMICs suffer 
the greatest morbidity and mortality associated with AMR, and stand 
to be disproportionately affected by projected increases inthis burden 
over future decades”. Antibiotic use is a key risk factor for human colo- 
nization or infection with drug-resistant organisms at scales spanning 
individuals, communities and countries”. Increasing rates of antibiotic 
consumption inLMICs” have highlighted tension between the need to 
ensure access to life-saving antibiotics globally, and the need to curb 
resistance selection”. Although it has been widely hypothesized that 
vaccines may reduce AMR burden by preventing infections that drive 
antibiotic consumption’, few real-world studies provide evidence of 
this effect. 

Acute respiratory infection (ARI) and diarrhoea are the leading causes 
of antibiotic use among children in LMICs”. Because few diagnostic 
tools exist to guide management of ARI and diarrhoea, antibiotic treat- 
ment of these conditions is generally informed by suspicion rather than 
confirmation of disease aetiology. Pneumococcal conjugate vaccines 
(PCVs) against 10 and 13 serotypes of Streptococcus pneumoniae and 
live attenuated rotavirus vaccines target the predominant causes of 
ARI and diarrhoea among children, respectively°’’. These vaccines 
have recently been introduced into routine paediatric immunization 
programmes of countries across the globe”. Although appropriate 


for children experiencing pneumococcal ARI, antibiotic treatment is 
not warranted for rotavirus-attributable diarrhoea. To date, no stud- 
ies have addressed the impact of PCV10/13 and rotavirus vaccines on 
antibiotic use in LMICs°. 

We analysed data from demographic and health surveys to esti- 
mate: (i) the incidence of antibiotic treatment for ARI and diarrhoea 
among children in LMICs; (ii) the effectiveness of PCV10/13 and 
rotavirus vaccines in preventing antibiotic use associated with ARI 
and diarrhoea among children in these settings; and (iii) the pro- 
portion of antibiotic-treated ARI and diarrhoea cases attributable 
to vaccine-serotype S. pneumoniae and rotavirus, respectively. We 
used these results to quantify the incidence of antibiotic consump- 
tion among children in LMICs prevented by current uses of PCV10/13 
and rotavirus vaccines, and the additional consumption that could be 
averted by achieving universal vaccine coverage targets’. 


Vaccine effects against antibiotic use 


We first assessed whether PCV10/13 and rotavirus vaccines are effective 
in preventing antibiotic-treated illnesses among children in LMICs. 
We undertook a case-control study comparing the odds of previous 
vaccination among children under five years of age reported to have 
experienced (cases) or not to have experienced (controls) ARI and 
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diarrhoea in the two weeks preceding the standardized household 
surveys. End points for our analysis were (i) all cases of disease, (ii) cases 
for which medical care or advice was sought, and (iii) cases receiving 
antibiotic treatment. Data were available on 65,815 children from sur- 
veys conducted from 2015 onward in Afghanistan, Angola, Armenia, 
Burundi, Ethiopia, Haiti, Lao People’s Democratic Republic, Malawi, 
Nepal, Pakistan, the Philippines, Senegal, Sierra Leone, South Africa, 
Tajikistan, the United Republic of Tanzania, Uganda and Zimbabwe 
(Supplementary Tables 1, 2; earlier surveys did not collect children’s his- 
tory of PVC10/13 and rotavirus vaccination). To eliminate confounding 
effects driven by any association between the access of children to vac- 
cines and the likelihood of disease or antibiotic treatment, we matched 
cases and asymptomatic controls from the same countries for age 
and survey date (each within one month), urbanicity, within-country 
wealth quintile and receipt of pentavalent vaccine doses (a potential 
confounder of ARI analyses, and a marker of vaccine and/or healthcare 
access). We also conducted planned subgroup analyses for children 
aged 0-23 months and aged 24-59 months due to well-documented 
differences in ARI and diarrhoea aetiology across age groups®”. 

Children who received at least three PCV10/13 doses experienced 8.7% 
lower odds of antibiotic-treated ARI than unvaccinated children (Fig. 1 
and Supplementary Table 3), although the 95% confidence interval for 
this estimate included the possibility of no effect (-1.3-19.7% reduction 
(values in parentheses indicate 95% confidence intervals throughout); 
two-sided P=0.079). The estimated reduction was driven primarily by the 
prevention of cases in children aged 24-59 months; within this age group, 
vaccination was associated with a 19.7% (3.4-43.4%) reduction in cases of 
antibiotic-treated ARI. By contrast, effects of PCV10/13 were not clearly evi- 
dent during the first 2 years of life, when less than 5% of cases of ARI are attrib- 
utable to vaccine-serotype pneumococci’. To test that our findings are the 
result of vaccine-conferred protection and not confounding factors, wealso 
estimated PCV10/13 effectiveness against diarrhoea and antibiotic-treated 
diarrhoea. Although these outcomes would not be expected to be causally 
related to PCV10/13 receipt, their association with previous vaccination 
may be prone to similar confounding pathways. We did not find strong 
evidence of associations between negative-control outcomes and receipt 
of PCV10/13, validating our finding that PCV10/13-conferred protection 
reduces the incidence of antibiotic-treated ARI. 

Similarly, children who received a full rotavirus vaccine series (at 
least two doses for countries included in this study) experienced 
8.1% (2.8-14.1%) lower odds of antibiotic-treated diarrhoea at ages of 
0-59 months than their unvaccinated counterparts (Fig. 1 and Sup- 
plementary Tables 4, 5). During the first 2 years of life, we estimated 
11.4% (4.0-18.6%) effectiveness of rotavirus vaccination against 
antibiotic-treated diarrhoea. Protection was not evident in children 
aged 24-59 months, when rotavirus is a less common cause of diar- 
rhoeaamong childreninLMICs°. The absence of associations between 
rotavirus vaccination and negative-control ARI end points provided 
validation for our finding that vaccine-conferred protection against 
rotavirus reduces the risk to children of antibiotic-treated diarrhoea. 


Analysing aetiological fractions using vaccine effects 


We next developed a simple model using the vaccine effective- 
ness estimates described above to investigate the proportions of 
antibiotic-treated ARI and diarrhoea attributable to vaccine-serotype 
pneumococciand rotavirus, respectively (Methods). The model took 
these inputs together with pooled estimates of vaccine efficacy against 
disease caused by the targeted pathogens based on meta-analyses 
of previous studies (Supplementary Table 7). Because rotavirus vac- 
cine efficacy beyond the second year of life is uncertain (and likely 
to be low)’””°, and because we did not identify strong evidence 
of vaccine-conferred protection against antibiotic-treated diar- 
rhoea in children aged 24-59 months, we limited our assessment of 
rotavirus-attributable antibiotic use to children aged 0-23 months. 
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Fig. 1| Effectiveness of pneumococcal and rotavirus vaccines against illness 
and antibiotic treatment. a-f, The estimated effectiveness against ARI and 
diarrhoea end points of PCV10/13 (a-c) and rotavirus vaccines (d-f) for all 
cases (a, d), cases for which treatment or advice was sought (b, e) and cases that 
were treated with antibiotics (c, f). Estimates were calculated as one minus the 
matched odds ratio and are shown as vaccine effectiveness. Analyses matched 
children with each end point to asymptomatic controls on the basis of country, 
age (within 1 month), visit timing (within 1 month), wealth quintile 
(country-specific), urbanicity and pentavalent vaccine doses received. The 
population available for analysis included 5,342 ARI cases (of whom 3,294 
sought treatment or advice and 1,913 received antibiotics) and 57,856 controls 
without ARI; and 9,944 diarrhoea cases (of whom 7,382 sought treatment or 
advice and 1,437 received antibiotics) and 40,059 controls without diarrhoea 
(Supplementary Tables 1, 2). Points and lines indicate median estimates and 
95% confidence intervals, respectively. We estimated vaccine effectiveness 
against negative-control end points (PCV10/13 effect against diarrhoea; 
rotavirus vaccine effect against ARI) to assess residual confounding asa 
validation step. PCV10/13 exposure was defined as >3 doses received. Because 
all countries in this analysis used Rotarix in their national immunization 
program, we defined >2 doses asa full rotavirus vaccination series. Numerical 
estimates can be found in Supplementary Tables 3-5. Quantiles are estimated 
through 2,000 independent draws from the distribution of estimates. 


We estimated that vaccine-serotype pneumococci would account 
for 10.9% (-1.6-25.4%) of antibiotic-treated ARI cases among children 
under five years old in PCV10/13-naive LMIC populations (Fig. 2and Sup- 
plementary Table 8); the negative lower limit of this confidence interval 
resulted from propagation of uncertainty in the estimated effect of 
PCV10/13 against antibiotic-treated ARI in children aged 0-59 months, 
and reflects the possibility that vaccine-targeted pneumococcal sero- 
types are not a significant cause of antibiotic consumption. Among 
children aged 24-59 months, we estimated that vaccine-serotype 
pneumococci cause 24.8% (4.3-54.6%) of antibiotic-treated ARI cases. 

We similarly estimated that rotavirus would cause 21.6% (2.1-40.3%) 
of antibiotic-treated diarrhoea among children aged 0-23 months 
in unvaccinated LMIC populations. Consistent with the findings of 
diarrhoea aetiology studies undertaken across differing LMICs°”, we 
obtained similar estimates of the rotavirus-attributable fraction in 
analyses stratified by the income status of each country, in which we 
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Fig. 2| Estimates of the attributable fraction for vaccine-preventable 
infections. a, b, We illustrate estimates of pathogen-specific attributable 
fractions of vaccine-serotype pneumococci for children aged 0-59 months 
(a, left) and 24-59 months (a, right), and rotavirus for children aged 

0-23 months (b).c, We also illustrate distributions of vaccine efficacy 
estimates against infections involving the vaccine-targeted organism, as 
estimated using meta-analyses (Supplementary Table 7); these estimates 
provided a basis for computing the attributable fraction (Supplementary 
Tables 1, 2). We considered PCV efficacy against vaccine-serotype invasive 
pneumococcal disease for the primary analysis (left). Asecondary analysis 
(Supplementary Table 8) used PCV efficacy against culture-confirmed 
pneumococcal vaccine-serotype acute otitis media. For rotavirus (right), we 
stratified estimates of human monovalent rotavirus vaccine (Rotarix) efficacy 
against rotavirus gastroenteritis in children aged 0-23 months in 
middle-income countries and low-income countries to account for the 
differential efficacy in these settings, and obtained a pooled estimate of the 
attributable fraction weighted by the number of children residing in 
middle-income countries and low-income countries (Methods). VE, vaccine 
effectiveness. Points and lines indicate median estimates and 95% confidence 
intervals, respectively. Quantiles are obtained through 2,000 independent 
draws from the distribution of estimates. 


accounted for variation in rotavirus vaccine efficacy across settings 
(Methods and Supplementary Tables 7, 8). We estimated that 21.3% 
(0.0-42.4%) and 22.7% (4.5-60.6%) of antibiotic-treated diarrhoea 
cases would be attributable to rotavirus among unvaccinated children 
in middle-income and low-income countries, respectively. 


Antibiotic use in LMICs 


To understand these estimates of vaccine effectiveness and 
pathogen-attributable fractions in terms of absolute antibiotic con- 
sumption, we estimated the incidence of ARI-related antibiotic use 
among children aged 24-59 months, and of diarrhoea-related anti- 
biotic use among children aged 0-23 months. We analysed data from 
household surveys covering 944,173 children across 77 countries from 
2006 to 2018 (Methods). We used the resulting estimates to extrapo- 
late country-level incidence rates for 58 additional countries without 
recent household surveys, aided by data on 405 national-level health, 
socioeconomic and demographic economic indicators from the World 
Bank Open Data catalogue (https://data.worldbank.org/) (Supplemen- 
tary Tables 9, 10). 

Estimated incidence rates of all cases and antibiotic-treated cases 
of ARI and diarrhoea were the highest in the poorest countries, 
and the lowest in the wealthiest countries (Fig. 3). Our estimates of 
country-specific ARI incidence in children aged 24-59 months ranged 
from 89.5 to 194.8 episodes per 100 children annually across all LMICs 
(Fig. 3 and Supplementary Table 11; we present further estimates of 
antibiotic-treated ARI in children aged 0-59 months in Supplementary 
Table 12). For children aged 24-59 months, the estimated proportion of 
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Fig. 3 | Estimated incidence across countries of ARI and diarrheal illnesses 
per 100 children. a, b, We estimate country-specific incidence of ARI and 
antibiotic-treated ARI in children aged 24-59 months (a) and diarrhoea and 
antibiotic-treated diarrhoea in children aged 0-23 months (b). Points and lines 
indicate median estimates for an individual country together with 
accompanying 95% confidence intervals. Estimates are obtained from analyses 
of Demographic Health Surveys and Multiple Indicator Cluster Surveys, 
comprising 944,173 children across 77 countries, as well as extrapolations 
based on 405 health, nutrition and population indicators for all LMICs. Points 
plotted in white are extrapolated from estimates based on household survey 
data (see Methods). Estimates for individual countries are providedin 
Supplementary Tables 11-13. Quantiles are obtained through5,000 
independent draws from the distribution of estimates. 


episodes treated with antibiotics ranged from 30.0% to 69.4%, yielding 
incidence rates of antibiotic-treated ARI of between 31.2 and 107.3 cases 
per 100 children annually. For diarrhoea in children aged 0-23 months, 
we estimated that country-specific incidence rates ranged from 297.5 
to 658.4 episodes per 100 children annually. Accounting for antibiotic 
treatment in an estimated 7.7-37.7% of diarrhoea episodes, annual rates 
of antibiotic-treated diarrhoea ranged from 24.6 to 164.5 episodes per 
100 children. Country-specific estimates are included in Supplemen- 
tary Tables 11-19. 


Pathogen-attributable antibiotic use 
Multiplying these incidence rate estimates by pathogen-specific attrib- 
utable fractions provides the burden of antibiotic-treated ARI and diar- 
rhoea targeted by PCV10/13 and rotavirus vaccines. In the absence of 
vaccination, we estimated that vaccine-serotype pneumococci would 
cause 15.9 (2.8-34.7) antibiotic-treated ARI episodes per 100 children 
aged 24-59 months annually in LMICs, translating to a total of 56.9 
(9.9-123.8) million antibiotic-treated ARI episodes (Fig. 4 and Supple- 
mentary Table 20). We estimated that 11.7 (2.0-24.8) million episodes 
occur annually in countries in which PCV10/13 has not been imple- 
mented. Analyses comprising all children aged 0-59 months residing 
in LMICs yielded similar burden estimates (Supplementary Table 21). 
We estimated that rotavirus would cause 19.7 (4.6-36.8) 
antibiotic-treated diarrhoea episodes annually per 100 children aged 
0-23 months in LMICs in the absence of vaccination, resulting ina 
total of 47.9 (11.1-89.5) million antibiotic-treated diarrhoea episodes 
(Fig. 4 and Supplementary Table 22). Of this burden, we estimated that 
20.4 (3.4-38.7) million episodes occur annually in settings without 
rotavirus vaccination. 


Vaccine-preventable antibiotic use 

The ‘direct’ protection conferred by the immunological effects of vac- 
cines on their recipients provides alower-bound estimate of the impact 
achievable by immunization programmes, as it excludes ‘indirect’ 
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Fig. 4 | Total vaccine-preventable antibiotic consumption and incidence per 
100 children. a,b, We estimated the incidence and total number of 
antibiotic-treated ARI and diarrhoea episodes attributable to 
PCV10/13-serotype pneumococci in children aged 24-59 months (a) and 
rotavirus in children aged 0-23 months (b), respectively. Left, incidenceinthe 
absence of vaccination. Right, the corresponding total number of cases, the 
total number of cases under 2018 vaccine coverage levels and under universal 
vaccine coverage. Estimates were stratified by income status (low income; 
lower middle income; upper middle income). Points indicate median estimates, 
with superimposed lines indicating 95% confidence intervals; violin plots 
illustrate the distribution around estimated incidence and total cases. 
Numerical estimates are provided inSupplementary Tables 20-22, 24-26. 
Quantiles are obtained through 5,000 independent draws fromthe 
distribution of estimates. 


protection resulting from reduced transmission of the vaccine-targeted 
pathogens”. As of 2018, an estimated 66.8% of age-eligible children 
in countries with PCV programmes in place received a full series of at 
least three PCV10/13 doses” (Supplementary Table 23). Under these 
coverage levels, we estimate that direct effects of PCV10/13 prevent 
23.8 (4.2-52.0) million episodes of antibiotic-treated ARI annually 
among children aged 24-59 months in LMICs (Fig. 4 and Supplemen- 
tary Tables 24, 25). This effect represents 42.4% (32.7-47.5%) of all 
antibiotic-treated ARI episodes that vaccine-serotype pneumococci 
would be expected to cause among children aged 24-59 months in 
LMICs, inthe absence of PCV10/13 use. We estimated that an additional 
21.7 (3.8-47.5) million episodes of antibiotic-treated ARI could be pre- 
vented by the direct effects of vaccination if PCV10/13 coverage were 
expanded to reachall children aged 24-59 months in LMICs. This resid- 
ual burden corresponds to 38.7% (30.0-43.4%) of all antibiotic-treated 
ARI episodes that we estimated to be attributable to vaccine-serotype 
pneumococci among children aged 24-59 months in LMICs. 

Roughly 77.3% of age-eligible children in LMICs in which rotavirus vac- 
cination has been implemented received a full rotavirus vaccine series 
as of 2018”. We estimated that direct effects of rotavirus vaccination 
currently prevent 13.6 (3.6-23.7) million episodes of antibiotic-treated 
diarrhoea annually among children aged 0-23 months in LMICs 
(Supplementary Table 26). This effect reflects only 31.0% (17.7-35.2%) 
of all antibiotic-treated diarrhoea episodes that we would expect rota- 
virus to cause annually among children aged 0-23 months who live in 
LMICs, inthe absence of vaccination. We estimated that an additional 


18.3 (4.2-32.6) million episodes of antibiotic-treated diarrhoea could 
be prevented annually by the direct effects of rotavirus vaccination 
among children aged 0-23 months under a scenario of universal vac- 
cine coverage, representing 42.1% (14.6% to 50.7%) of all antibiotic use 
that we estimated to be attributable to rotavirus in this population. 


Discussion 


We found that vaccines against S. pneumoniae and rotavirus reduce 
antibiotic consumption among children in LMICs. In the absence of 
these vaccines, we estimated that PCV10/13-serotype pneumococci 
and rotavirus would cause 56.9 million and 47.9 million episodes of 
antibiotic-treated ARI and diarrhoea, respectively, among the age 
groups at greatest risk for these infections. Current PCV10/13 and 
rotavirus immunization programmes prevent an estimated 37.4 mil- 
lion episodes of antibiotic-treated illnesses annually through direct 
vaccine-conferred protection among children under five years of age 
in LMICs. Within this population, an additional 40.0 million episodes 
canbe prevented through improvements in vaccine coverage in settings 
in which PCV10/13 and rotavirus vaccines are currently implemented, 
and through the introduction of these vaccines in countries in which 
children currently do not receive them. 

Itis helpful to consider the effects that we estimate alongside those of 
other interventions aiming to reduce unnecessary antibiotic consump- 
tion. Infrastructural improvements that address water, sanitation and 
hygiene have been prioritized in LMIC action plans to combat AMR®”. 
However, randomized trials reporting suboptimal effects of such inter- 
ventions on disease incidence” suggest that these approaches may 
present limited opportunities to prevent antibiotic-treated illness. 
Although certain antimicrobial stewardship interventions are feasible 
in LMICs”, the need to tailor such interventions to country-specific 
contexts hinders the development, adoption and assessment of univer- 
sal strategies”®. Recent aetiological studies reveal that most antibiotic 
courses administered to cases of ARI and diarrhoea among childrenin 
LMICs are unnecessary or inappropriate’”’, but few diagnostic tools 
exist to inform the clinical management of these infections. This short- 
age of alternative approaches to optimize antibiotic treatment in LMICs 
underscores the potential value of vaccination as a component of 
strategies to reduce AMR. 

We used the recent introduction of PCV10/13 and rotavirus vaccines 
into paediatric immunization programmes as a natural means to esti- 
mate antibiotic use attributable to the vaccine-targeted pathogens. 
Strengths of our study include the opportunity to estimate the direct 
effects of the vaccines and disease burden using individual-level data 
from consistently designed surveys; statistical power for the estimation 
of small effect sizes in our case-control study of more than 60,000 chil- 
dren across 16 countries; and our ability to validate vaccine effective- 
ness estimates against negative-control end points. However, certain 
limitations should be considered. First, we relied on mother-reported 
disease outcomes and antibiotic use rather than a standardized clinical 
assessment. Although an imperfect data source”’, maternal reports 
have been validated for the measurement of acute illness, care-seeking 
andantibiotic receipt among children in LMICs in previous studies»”? >). 
Risk factor data in household surveys provided an opportunity to cor- 
rect for potential differences in reporting thresholds for these condi- 
tions across settings—a limitation of previous burden studies based 
on survey data***’, Second, our vaccine-analysis method permitted 
us to estimate pathogen-attributable antibiotic use only within the 
strata for which we had high-quality estimates of vaccine efficacy, as 
well as evidence of effect of the vaccines in our case-control study. For 
this reason, our findings centre on the age groups at greatest risk for 
pneumococcal ARI and rotavirus diarrhoea, among all children under 
five years of age who live in LMICs®”. Finally, our focus on the direct 
effects of the vaccines, which we were able to estimate through a case-— 
control study, may lead to underestimation of the population-level 
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impact of vaccination programmes. Indirect effects of PCV10/13 and 
rotavirus vaccines have also been reported in LMICs in which these 
vaccines have been implemented in routine paediatric immunization 
programmes***”, 

Several further considerations inform the interpretation of the 
effects of PCV10/13 on antibiotic consumption. Acute otitis media, 
sinusitis and other upper respiratory infections that are potentially 
attributable to pneumococci account for a substantial share of antibi- 
otic use among infants within high-income settings*®. These conditions 
may not be captured with high sensitivity by the ARI definition used in 
household surveys*”””. As a result, our use of an ARI end point is likely 
to underestimate the antibiotic consumption that is preventable by 
PCV10/13. By contrast, replacement of vaccine-targeted pneumococcal 
serotypes by non-vaccine serotypes may reduce the long-term effect 
of PCV10/13 on various pneumococcal disease end points”. However, 
the association of vaccine-targeted serotypes with conditions such 
as pneumonia and otitis media**’, which drive antibiotic consump- 
tion, suggests a limited capacity of replacement serotypes to offset 
the effects of PCV10/13 on this consumption (Extended Data Fig. 1). In 
addition, the reported extent of vaccine-driven serotype replacementin 
pneumococcal carriage and disease has remained lower in LMICs**** 
than in high-income settings, for reasons that remain incompletely 
understood*. 

Emerging recognition of the threat posed by AMR has led to global, 
regional and national action plans to address the increasing burden 
of resistant disease®*” ’. A lack of evidence on the effectiveness of 
strategies advocated in these plans—which span public education 
efforts, human and veterinary antimicrobial stewardship, sanitation 
and other interventions—constrains the ability of policymakers to 
allocate resources to approaches with the greatest effect. Our study 
demonstrates that PCV10/13 and rotavirus vaccines reduce antibi- 
otic consumption among children in LMICs, and that a substantial 
volume of antibiotic consumption may be prevented by introducing 
and increasing the global coverage of these vaccines. This evidence 
supports the prioritization of childhood vaccines as part of a global 
strategy to combat AMR. 
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Methods 


Household survey data 

We used data from Demographic Health Surveys (DHS; rounds V-VII; 
https://www.dhsprogram.com) and Multiple Indicator Cluster Sur- 
veys (MICS; rounds 5, 6; http://mics.unicef.org/surveys) undertaken 
in various LMICs from 2006 to 2018. Design of the two surveys was 
coordinated to facilitate data comparison and joint analyses across 
countries and over time. In brief, a two-stage probability sampling 
approach was used to select geographical clusters within countries 
or provinces of countries, and to select households within clusters. 
Survey questionnaires addressed household composition as well as 
risk factors, health outcomes and healthcare utilization among house- 
hold occupants; original survey instruments can be viewed at https:// 
dhsprogram.com/publications/publication-DHSG4-DHS-Question- 
naires-and-Manuals.cfm (for DHS data) and https://mics.unicef.org/ 
surveys (for MICS data). 

Mothers were asked to answer survey questions on behalf of children 
under five years of age. Health information collected about children 
included history of ARI symptoms (defined as short, rapid breathing 
or difficult breathing that was chest-related or occurring with cough) 
and diarrhoea (defined as the occurrence of three loose stools ina 24-h 
period) inthe 2 weeks preceding the survey. Mothers were asked about 
care seeking at public-sector or private-sector hospitals or clinics, phar- 
macies or doctor’s offices, and receipt of drugs including antibiotic 
pills, syrups or injections, antimalarials, and analgesics or antipyret- 
ics. For diarrhoea, mothers were also asked about children’s receipt 
of antimotility drugs, intravenous feeding, zinc and oral rehydration. 
Vaccination status (including antigens, doses and date of receipt) was 
collected from vaccination cards. Round VII of DHS and round 6 of MICS 
were the first to include PCV10/13 and rotavirus vaccination; thus, the 
case-control study (detailed below) included data from DHS round VII 
and MICS round 6 only. 


Case-control study 

We estimated PCV10/13 effectiveness against ARI-related end points 
and rotavirus vaccine effectiveness against diarrhoea-related end 
points in an individually matched, case-control study to compare 
counterfactual outcomes among individuals encountering similar 
conditions of transmission, as well as other exposures (the matching 
procedure is described below). End points were: occurrence of ARI 
and diarrhoea symptoms; occurrence of ARI and diarrhoea symptoms 
for which medical treatment or advice was sought (from a source 
other than atraditional practitioner); and occurrence of ARI and diar- 
rhoea symptoms for which antibiotic pills, syrups or injections were 
received. For diarrhoea analyses, we also considered an end point of 
diarrhoea symptoms resulting in oral rehydration using pre-packaged 
salts or fluid, or recommended homemade fluids or intravenous solu- 
tion, among children who sought care (results are included in Sup- 
plementary Tables 4, 5). For the purposes of our analyses, children 
experiencing each end point in the preceding 2 weeks were defined 
as ‘cases’, and those who did not report symptoms were defined as 
controls. 

Within each country, we matched cases and controls onthe basis of 
age and visit timing (each within one month), within-country wealth 
quintile (a variable defined on the basis of country-specific house- 
hold socioeconomic indicators), urbanicity (residence within a sam- 
pling cluster defined as urban or rural) and pentavalent vaccine doses 
received (to reduce confounding by healthcare access, including access 
to vaccination, and to control for any effects of diphtheria, pertussis and 
Haemophilus influenzae type b antigens on ARI). Healthcare utilization 
questions for the ARI end point did not distinguish whether care was 
received due to respiratory symptoms or due to fever. Therefore, to 
separate vaccine effects from antibiotic treatment associated with ARI 
symptoms (rather than treatment of fever cases that would be expected 


to occur independent of ARI symptoms, for example, due to malaria, 
typhoid or arboviruses), we also matched children on mother-reported 
fever and excluded those receiving antimalarials. 

We matched cases and controls without replacement at a 1:3 
ratio and undertook statistical inference in a resampling frame- 
work, generating 2,000 independent draws from the permuta- 
tion distribution of case-control match assignments. Within each 
iteration, we estimated the matched odds ratio (OR,,) using con- 
ditional logistic regression. Defining Y=1 and Y= 0 as case and 
control status, respectively, and V=1 and V=0 as vaccinated and 
unvaccinated status, respectively, we took 1— OR, (VIY) to measure 
vaccine effectiveness. 


Assessment of residual confounding 

The relatively recent implementation of PCV10/13 and rotavirus vac- 
cines inthe immunization programmes of countries may make penta- 
valent vaccine an imperfect surrogate of the access of children to these 
newer products. In particular, within-country geographical differences 
in access to PCV10/13 and rotavirus vaccines may introduce residual 
confounding if the differences in vaccine access are associated with 
geographical variation in disease burden. 

Data from DHS and MICS include the subnational administrative 
regions in which children reside, although this variable must be inter- 
preted with caution as the size and uniformity of subnational regions 
varies across countries (between 4 and 3,399 children were available 
from each region (median, 217 children), before accounting for match 
eligibility). Although it was not possible to match children at precise 
subnational resolutions for our case-control study, we used subna- 
tional administrative region designators to assess risk for residual 
confounding by geography. We computed the proportion of vari- 
ance in ARI and diarrhoea outcomes, and in PCV10/13 and rotavirus 
vaccine receipt, explained by subnational region after accounting 
for all other matching factors included in our analysis. Specifically, 
we assessed the difference in the summed squared error of residuals 
from regression models treating disease end points and vaccina- 
tion as outcome variables, which either included or did not include 
subnational administrative units as intercepts after controlling for 
age and visit timing (each as monthly factor variables), urbanicity, 
wealth quintile, mother-reported fever (only for analyses of ARI and 
PCV10/13) and pentavalent vaccine doses received. Low estimates 
of the proportion of variance explained by subnational region in 
either the exposure or outcome suggest minimal risk of confound- 
ing by geography. The results of this analysis are included in Sup- 
plementary Table 6. In addition, to assess whether negative-control 
analyses provided a sufficient opportunity to correct for residual 
confounding associated with access to these newer vaccines, we 
estimated the correlation in receipt of full PCV10/13 and rotavirus 
vaccine series among age-eligible children in countries where both 
vaccines were recommended. We describe the results of these analy- 
ses in the Supplementary Information section ‘Assessment of residual 
confounding’. 


Vaccine probe study 

We formulated a simple model that enables the estimation of the 
proportion of episodes attributable to vaccine-targeted pathogens 
for each end point (for example, ARIsymptoms or antibiotic-treated 
ARI symptoms) in the absence of vaccination. We defined p as the 
risk for a child to experience a given end point attributable to the 
vaccine-targeted pathogen in the 2-week recall period, w as the risk 
of the same end point attributable to all other causes and 0 as the 
relative risk of disease caused by the vaccine-targeted pathogen ina 
vaccinated individual relative to an unvaccinated individual, owing 
only to vaccine-conferred protection (such that 1 — 6 measures vac- 
cine efficacy against illness caused by the targeted pathogen). Under 
this framework 


Pr(V=1, Y=1)Pr(V=0, Y=0) 
Pr(V=0, Y=1)Pr(V=1, Y=0) 
[6p + w)[1-p- a] 
[1- 9p - a] [p+ a] 
_9pto , 1-p-@ 
pt+o 1-0p-a@ 


OR, (VIY) = 


(1) 


1-p-@ 
1-6p-@ 


Noting that =1forsmallp anda 


Op +o 


ORWVIN = 


(2) 


or in other words, we expected the matched odds ratio to provide a 
reasonable estimator for the relative hazard of each end point, given 
vaccination. Using equation (2), above 


_ @(1-OR4(VIY)) 
~~ OR, (VIY) -0 Ga) 


and 
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The above approximation relies on the assumption that OR,,(V|Y) is 
similar to the relative risk of disease, given vaccination. We illustrate 
in Extended Data Fig. 2 the potential degree of bias under differing 
values of 6, p and w that are concordant with our results and those of 
external studies assessing all-cause burden of ARI and diarrhoea among 
children in LMICs**°*, The degree of bias approaches zero under any 
of the following conditions: (i) the all-cause disease end point is rare 
(that is, p + w ~ O); (ii) the attributable fraction is low (p « @); or (iii) 
the vaccine is highly efficacious against disease due to the pathogen 
of interest (8 =O). 


Estimation of the fraction attributable to rotavirus 
Because previous studies have reported differential efficacy and effec- 
tiveness of rotavirus vaccines in socioeconomically distinct settings, 
we generated stratified estimates of OR,(V|Y) (in the case-control 
study) and 6 (in the meta-analysis, as described below) for children 
aged 0-23 months residing in middle-income and low-income coun- 
tries. We used these inputs to generate separate rotavirus-attributable 
fraction estimates associated with each end point for middle-income 
and low-income countries. We reconstructed the distribution of the 
rotavirus-attributable fraction for each end point across all LMIC set- 
tings (Fig. 2and Supplementary Table 8) using the weighted average 
of paired draws from the independent distributions of the attribut- 
able fraction estimates of middle-income and low-income countries; 
weights were proportional to the total populations of children aged 
0-23 months residing in middle-income and low-income countries. 
To account for differential direct effects of the vaccine against 
rotavirus-attributable diarrhoea end points in middle-income and 
low-income countries, we used our stratified estimates of rotavirus 
vaccine efficacy, effectiveness and attributable fractions for all burden 
assessments. 


Meta-analysis of vaccine efficacy studies 

We obtained estimates of 6 for PCV10/13 and rotavirus vaccine using 
a meta-analysis of vaccine efficacy studies. Because @ has not been 
estimated for the effects of PCV10/13 against vaccine-serotype ARI, we 
took PCV efficacy against invasive disease caused by vaccine-serotype 
pneumococci asa primary estimate of protection, and PCV efficacy 
against acute otitis media caused by vaccine-serotype pneumococci as 


alower-bound estimate, consistent with previous studies”. Estimates 
of PCV efficacy against these end points were aggregated from two 
previously published reviews**>*. We used estimates of efficacy against 
invasive pneumococcal disease from only low- and middle-income 
countries. Because such estimates were not available for PCV efficacy 
against acute otitis media, we did not restrict estimates for this end 
point on the basis of study setting. 

For rotavirus, we used estimates of 6 from studies that estimated the 
efficacy of the human pentavalent rotavirus vaccine (Rotarix) against 
rotavirus gastroenteritis in the first two years of life in LMICs, again 
aggregated ina previous systematic review. Data were restricted to 
Rotarix studies because this was the only vaccine used in the national 
immunization programmes of countries included in the case-control 
study. 

For both end points, we estimated pooled values of 8 using inverse 
variance-weighted random effect models using log-transformed effect 
size estimates from each study. 


Risk factor analysis of household survey data 

We used a regression-based approach to estimate country-level rates 
of incidence of ARI and diarrhoea in the year 2016 in the absence of 
PCV10/13 and rotavirus vaccine use. Because previous studies have 
described marked variation in the clinical threshold for maternal or 
caregiver report of ARI and diarrhoea”****?*°, we aimed to reconstruct 
incidence rates on the basis of risk factor prevalence, controlling for 
region-level intercepts that arise from differential reporting of symp- 
toms. We built Poisson regression models for the outcome of report- 
ing of syndrome Z (signifying all-cause ARI or diarrhoea) for child iof 
the form 


E(ZX)) = Maw * €XP ‘ +) BX eit | (4) 
k 


where a represents a general intercept for the log-transformed inci- 
dence rate. Each variable X, indicates the exposure of childito the Ath 
risk factor for syndrome Z, associated with a £,-fold increase in inci- 
dence; the ¢; errors are independent and identically distributed with 
mean zero among all children. The term 7, indicates the probability 
of reporting syndrome Zgiven it truly occurred among childreninthe 
same region as child i. We extracted the risk factors listed in Supple- 
mentary Table 27 on the basis of their inclusion in both DHS and MICS 
datasets, and based on previous evidence of their relevance to the risk 
of ARI and/or diarrhoea for children*!>”**. In addition to variables 
included in DHS and MICS, we extracted the gross domestic product 
(GDP) per capita of each country in the survey year and in 2016. We 
included covariates in the model identified to have a statistically sig- 
nificant (P< 0.05) unadjusted association with the outcome variable 
that persisted in multivariable analyses, and tested for significant 
pairwise interactions among covariates. Risk factor estimates are shown 
in Extended Data Fig. 3 and Supplementary Tables 28-30 for ARI (for 
children aged 24-59 months and 0-59 months) and diarrhoea (for 
children aged 0-23 months). 

We similarly modelled the probability (H) for ARI and diarrhoea to be 
treated with antibiotics using a Poisson regression model of the form 


E(HX;)) = exp c + Xei+ a| 2) 
k 


We evaluated the same risk factors for inclusion as in our analyses of 
ARI and diarrhoea; here we defined 7;as mean-zero independent and 
identically distributed error terms. The individual risk factor estimates 
are shown in Extended Data Figs. 4,5 and Supplementary Tables 31-33 
for antibiotic-treated ARI and diarrhoea. 

We limited analyses of ARI and diarrhoea end points to children 
who had received no PCV10/13 doses and no rotavirus vaccine doses, 
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respectively. For DHS rounds V-VI and MICS round 5, which did not 
include PCV10/13 and rotavirus vaccine data collection, children were 
assumed not to have received PCV10/13 or rotavirus vaccine doses 
if countries had not yet implemented these vaccines in their immu- 
nization programmes at the time of the survey. Surveys undertaken 
in countries that had implemented PCV10/13 or rotavirus vaccines 
were excluded if information on the receipt of these vaccines among 
children was not collected. 

Missing values in the outcome and risk factor variables were popu- 
lated in five independent pseudo-datasets by multiple imputation 
using the Amelia Il package in R°’; in total, 11.4% of data cells were 
missing at the outset of analyses, including 0.6% and 2.4% of ARI and 
antibiotic-treated ARI observations, and 0.1% and 0.4% of diarrhoea and 
antibiotic-treated diarrhoea observations. Additional measures” of 
country-level antibiotic access (Supplementary Table 27) were included 
duetoscientific interest and a consideration that these covariates may 
aid imputation. 


Assumption of a Poisson outcome distribution 

Translating the reported history of any ARI or diarrhoea in the pre- 
ceding 2 weeks to a Poisson-distributed number of ARI or diarrhoea 
episodes required two assumptions: that episodes are acute (consist- 
ent with the nature of illness solicited by DHS and MICS household 
surveys) and that overall incidence rates are low (resulting in alow or 
negligible likelihood of =2 distinct episodes occurring ina single fort- 
nightly period in whicha child was reported to have experienced ARI or 
diarrhoea)*°. The low frequency of diarrhoea lasting 14 days or longer 
ina previous cohort study in LMICs” (accounting for 4.9% and 1.8% of 
all diarrhoea episodes in the first and second years of life, respectively) 
indicated that our assumption of an acute nature of mother-reported 
illness was appropriate for the diarrhoea outcome. Similarly, previous 
studies of respiratory infections among childrenin LMICs have reported 
short average durations of new-onset disease episodes, with 87% to 
100% of episodes lasting <2 weeks® ™. For incidence rates similar to 
those estimated in our study (approximately 4 diarrhoea cases and 
1.25 ARI cases per child per year), assuming a 7-day average duration 
of illness and a definition of 3 symptom-free days to distinguish sepa- 
rate disease episodes, the assumption of exponentially distributed 
inter-event times (concordant with Poisson-distributed counts) would 
yield asecond new-onset diarrhoea case in only 4.3% of 2-week report- 
ing periods witha first case identified, and a second new-onset ARI case 
in only 1.4% of reporting periods witha first case identified. This small 
bias, if applicable to our analyses, would be expected to lead to slight 
underestimation of true burden. 


Incidence rate estimation 

From the fitted models, we generated standardized estimates of the 
hazard rates of ARI and diarrhoea for individual children for the year 
2016. To correct for differential reporting, we took the lowest and high- 
est estimated reporting probability from any region (7g) to provide 
lower and upper bounds, respectively, on true incidence rates. 

We sampled from the multivariate normal distribution of 
(log-transformed) regression parameter estimates and model residuals, 
thus accounting for uncertainty in the effects of risk factors on event 
rates. To propagate uncertainty in our analyses driven by sampling vari- 
ability in the surveyed population as well as our multiple imputation 
procedure, we repeated this analysis 1,000 times for each of 5 multiply 
imputed datasets. At each iteration, we resampled children accord- 
ing to survey weights before sampling ARI and diarrhoea outcomes 
stochastically based on individual-level risk factors. On the basis of 
the estimated incidence rate of ARI and diarrhoea of each child from 
these risk factors, we sampled a total number of annual cases assuming 
this value would be Poisson-distributed with respect to the underlying 
individual-specific rate. We used the same approach to sample the 
antibiotic treatment status of each case based on model-generated 


estimates of the probability of antibiotic treatment, given individual 
risk factors, under an assumed Bernoulli outcome distribution. 


Extrapolation of burden estimates to countries without 
household survey data 

For the 58 LMICs not covered by DHS rounds V-VII or MICS 
rounds 5 and 6, we extrapolated rates of incidence of all cases 
and antibiotic-treated cases of ARI and diarrhoea on the basis of 
national-level variables aggregated as Health, Nutrition and Population 
Statistics by the World Bank (https://databank.worldbank.org/source/ 
health-nutrition-and-population-statistics). For each of 5,000 draws 
from the distribution of country-specific incidence-rate and treatment 
probability estimates (1,000 draws each from 5 multiply imputed data- 
sets), we fitted regression tree models using tenfold cross-validation 
to arandomly sampled 90% set of countries via stochastic gradient 
boosting using the caret package in R with default tuning parameters™. 
We saved predictions for the 10% ‘holdout’ set of countries to assess 
model performance (Extended Data Fig. 6). We resampled the 90/10 
set of training and holdout countries at each iteration. We used fitted 
models to generate out-of-sample predictions for countries without 
DHS and MICS data. 


Estimating the vaccine-preventable disease burden 

We multiplied independent draws from the distribution of our 
estimates of the proportion of antibiotic-treated ARI episodes and 
antibiotic-treated diarrhoea episodes attributable to vaccine-serotype 
pneumococci and rotavirus, respectively, by independent draws 
from the distribution of our estimates of country-level incidence 
rates for antibiotic-treated ARI (for children aged 24-59 months and 
0-59 months) and antibiotic-treated diarrhoea (for children aged 
0-23 months). We used our stratified estimates of rotavirus-attributable 
fractions (for low-income and middle-income countries) for all analy- 
ses of diarrhoea burden end points. For analyses estimating vaccine 
direct effects, we multiplied independent draws from the distribution 
of our estimates of PCV10/13 direct effects against antibiotic-treated 
ARI and rotavirus vaccine direct effects against antibiotic-treated diar- 
rhoea, as obtained in the case-control study, by independent draws 
from the distribution of our estimates of country-level incidence 
rates for antibiotic-treated ARI (of children aged 24-59 months and 
0-59 months) and antibiotic-treated diarrhoea (in children aged 
0-23 months). 

To generate estimates of burden across multiple countries (glob- 
ally, or grouped by income strata), we estimated total cases in each 
country by multiplying independent draws of the incidence rate for 
each end point by the number of children in each age group (using 
World Bank estimates; https://databank.worldbank.org/home). For 
each sampled parameterization (defined by estimates of vaccine 
effectiveness, pathogen-attributable burden and incidence rates), 
we summed total cases across countries belonging to a stratum of 
interest and estimated incidence by dividing total summed cases by 
the population at risk. 


Potential reduction in PCV10/13 effects owing to serotype 
replacement 

Replacement of vaccine-targeted pneumococcal serotypes by nonvac- 
cine serotypes may partially offset the estimated effect of PCV10/13 on 
all-cause ARI end points. We assessed the extent to which this may alter 
our estimates by determining the maximal increase in nonvaccine-type 
antibiotic-treated ARI that could be expected under scenarios consist- 
ent with reported serotype replacement in pneumococcal carriage in 
LMIC settings. 

We defined p,;(t) and pyy7(t) as the prevalence of carriage of 
vaccine-type (VT) pneumococci and nonvaccine-type (NVT) pneumo- 
cocci at time t,andr,and ry,as the rates of progression of vaccine-type 
and nonvaccine-type pneumococci from carriage to ARI*”. We used 


previous estimates of the relative incidence of vaccine-type and 
nonvaccine-type invasive pneumococcal disease per carrier®™, and of 
vaccine-type and nonvaccine-type otitis media per carrier®, to sup- 
ply bounds on nj; and ryy; for ARI, as this quantity has not previously 
been measured. We used data from three studies of pneumococcal 
carriage among children less than five years old before and after vac- 
cine introduction in settings with long-term, continuous, prospective 
surveillance in place”**+ to supply pre-vaccination (p,;(0) and pyy(0)) 
and post-vaccination (p,;(1) and pyy7(1)) estimates of carriage preva- 
lence. We took [pyy7(1) — Pyyr(O) Iryyr to indicate the excess incidence 
of disease attributable to nonvaccine serotypes that would result from 
post-vaccination replacement of vaccine-targeted serotypes by non- 
vaccine serotypes. 

We compared this excess post-vaccination incidence attributable 
to serotype replacement with two measures of pre-vaccination dis- 
ease incidence. The ratio [pyy;(1) — Pyyr(O) Ifyyi/[Pyr(O)v1] compared 
the excess replacement-associated incidence of nonvaccine-serotype 
disease to the incidence of vaccine-serotype disease targetable by 
vaccination with PCV10/13. Second, defining AF as the fraction of 
all-cause disease incidence attributable to vaccine-type pneumococci, 
the ratio [Pyyr(1) — Pyyr(O) IryyrAF/[py1(O0)ry7] compared the extent of 
replacement-driven nonvaccine-serotype disease incidence to all-cause 
incidence in the pre-vaccination era. These estimates are shown in 
Extended Data Fig. 1, using the attributable fraction estimates for 
all-cause antibiotic-treated ARI (as this was the primary end point of 
interest in our analyses). 

In light of the assumptions informing our interpretation of 
PCV10/13 effectiveness against non-specific ARI end points in the 
case-control study, we consider the estimates provided by this analy- 
sis to represent an upper bound onthe potential replacement-driven 
disease burden. Our estimates of vaccine-serotype pneumococcal 
attributable fractions based on the case-control study assumed no 
effect of PCV10/13 on disease not attributable to vaccine-serotype 
pneumococci. However, children receiving PCV10/13 in prelicensure 
or early-implementation studies in fact have been reported to experi- 
ence higher risk of carriage of nonvaccine serotypes” and resulting 
nonvaccine-type disease®*”. As such, odds ratio estimates in our 
case-control study may be interpretable as representing the ‘net’ 
reduction in disease risk among vaccinated children, accounting for 
both protection against vaccine-type pneumococci and increased 
acquisition of nonvaccine types. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


MICS data are publicly available at http://mics.unicef.org/. DHS data 
are available upon request from https://dhsprogram.com/Data/. Files 
posted in the GitHub repository (http://github.com/joelewnard) 
include the reduced versions of the DHS and MICS datasets neces- 
sary to replicate analyses. 


Code availability 

The analysis code necessary to replicate all parts of the analysis and for 
the generation of the figures is available from GitHub (http://github. 
com/joelewnard). 
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Extended Data Fig. 1| Potential effect of pneumococcal serotype 
replacement. a, We compare the maximum estimate of the replacement 
disease associated with increased carriage of nonvaccine-type (NVT) 
pneumococcito the prevented burden of disease associated with vaccine-type 
(VT) pneumococci based on the approach described in the Methods section 
‘Potential reduction in PCV10/13 effects owing to serotype replacement’. We 
plot the ratio of replacement-attributable NVT disease as a function of the 
relative pathogenicity of NVT and VT pneumococci. As this ratio may differ 
according to disease end point, the measure presented inacan beinterpreted 
as end point agnostic. Estimates from two previous studies ® of acute otitis 
media (AOM) and invasive pneumococcal disease (IPD) provide a range of 
0.218-0.387 for the relative pathogenicity of NVT compared with VT 
pneumococciin these conditions. We plot estimates based on serotype 


AOM rel. disease potential 
(Lewnard et al., 2017) 


IPD rel. disease potential 
(Shouval et al., 2009) 
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0.444 (Usuf et al., 2019) 
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(Heinsbroek et al., 2018) 


Ratio of replacement NVT burden to 
pre-vaccination ARI burden (all causes) 


Relative disease potential 
of NVT (ref. VT) 


replacement observations from three carriage studies in LMICs with 
continuous, prospective surveillance in place before and after PCV10/13 
implementation®**”. b, We next illustrate the estimated ratio of 
replacement-attributable antibiotic-treated ARI to all-cause antibiotic-treated 
ARI among children aged 24-59 months, based on our estimates of the fraction 
of antibiotic-treated ARI attributable to VT pneumococci (Fig. 2). On the basis 
of the input values from the carriage and disease studies cited*** °°", we infer 
that the maximum extent of antibiotic-treated ARI that replacement serotypes 
would account for would decrease to between 2.6% and 8.2% of the 
pre-vaccination incidence of all-cause antibiotic-treated ARI; this range is well 
below the estimated reduction associated with protection against VT 
pneumococci. 
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Extended Data Fig. 2| Bias that occurs when using the odds ratioto 
approximate the relative risk. a—d, We illustrate the degree of bias that 
occurs in attributable fraction estimates (indicated by the departure of the 
estimated proportion from the 1:1 diagonal) under differing parameterizations 
with respect to the true aetiological fraction and vaccine efficacy against the 
targeted infection. Values correspond to our meta-analytic estimates of PCV 
efficacy against vaccine-serotype invasive pneumococcal disease (a), PCV 
efficacy against vaccine-serotype acute otitis media (b), rotavirus vaccine 
efficacy against rotavirus gastroenteritis in middle-income countries (c) and 
rotavirus vaccine efficacy against rotavirus gastroenteritis in low-income 
countries (d). The range of 15-30% is highlighted in grey as plausible values for 


True proportion 


the proportion of disease attributable to vaccine-serotype pneumococci for 
children aged 24-59 months and rotavirus for children aged 0-23 months, 
based on previously published studies®’. Values are plotted ona blue-to-red 
ramp corresponding toincreases insymptom prevalence (p + @); values of 0.01, 
0.02, 0.03, 0.05 and 0.1correspond to incidence rates of 52,104,156, 261and 
521 episodes per 100 children annually (roughly the range of our all-cause ARI 
and diarrhoea incidence rate estimates); under the assumption of a3-day 
duration of symptoms, the same prevalence values correspond to incidence 
rates of 122 to1,217 episodes per 100 children annually. These outcomes 
suggest negligible bias in aetiological fractions over the range of plausible 
values for our analysis. 
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Extended Data Fig. 3 | Fitted association of improved water and sanitation 
access with diarrhoea risk. a—c, We illustrate model-estimated rates of the 
incidence (Inc.) per 100 children of diarrhoea under differing conditions of 
access to improved water and sanitation conditions among at ages 

0-23 months. Estimates account for interactions of sanitation and water access 
with GDP per capita. Shaded regions indicate 95% confidence intervals around 
estimates. True intercepts are dependent on the distribution of other 
individual and setting-level risk factors; plotted estimates do not account for 
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the joint distribution of other risk factors with respect to the access of children 
to water and sanitation at the country level and GDP per capita. Sanitation 
conditions include improved (a) and unimproved sanitation (b) or open 
defaecation (c) incombination with or without improved water access. Model 
parameter estimates are presented in fullin Supplementary Table 30. 
Estimates are obtained from analyses of DHS and MICS surveys comprising 
377,665 children across 77 countries. Quantiles are obtained through5,000 
independent draws from the distribution of estimates. 
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Extended Data Fig. 4 | Fitted association of GDP per capita with antibiotic 
treatment of ARI, by region. a, b, We illustrate model-estimated probabilities 
of antibiotic treatment for cases of ARI among children aged 24-59 months in 
rural (a) and urban (b) settings associated with region and GDP per capita. 
Estimates for central and eastern Europe are omitted from plots as wide 
uncertainty intervals would obscure the illustration of associations in other 
settings. Shaded regions indicate 95% confidence intervals around the 
estimates. True intercepts are dependent on the distribution of other 
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individual and setting-level risk factors; plotted estimates do not account for 
the joint distribution of other risk factors with respect to region and GDP per 
capita. Model parameters are included in Supplementary Table 31; estimates 
are nearly identical for children aged 24-59 months and 0-59 months 
(Supplementary Table 32). Estimates are obtained from analyses of DHS and 
MICS surveys comprising 566,508 children across 77 countries. Quantiles are 
obtained through 5,000 independent draws from the distribution of 
estimates. 
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Extended Data Fig. 5 | Fitted association of GDP per capita with antibiotic 
treatment of diarrhoea, by region. a, b, We illustrate model-estimated 
probabilities of antibiotic treatment for cases of diarrhoea among children 
aged 0-23 months in rural (a) and urban (b) settings associated with region and 
GDP per capita. Estimates for central and eastern Europe are omitted from 
plots as wide uncertainty intervals would obscure the illustration of 
associations in other settings. Shaded regions indicate 95% confidence 
intervals around estimates. True intercepts are dependent on the distribution 
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of other individual and setting-level risk factors; plotted estimates do not 
account for the joint distribution of other risk factors with respect to region 
and GDP per capita. Model parameter estimates are providedin 
Supplementary Table 33. Estimates are obtained from analyses of DHS and 
MICS surveys comprising 377,665 children across 77 countries. Quantiles are 
obtained through 5,000 independent draws from the distribution of 
estimates. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


4) A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 
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For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


qo 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Our analysis used publicly-available data sources from the MICS and DHS surveys. MICS data are publicly available at http:// 
mics.unicef.org/. DHS data are available upon request at https://dhsprogram.com/Data/. 


Data analysis Analyses were conducted in R (version 3.5.1). We used the "metafor" package (for meta-analysis); the "survival" package (for fitting of 
conditional logistic regression models); the "Amelia II" package (for multiple imputation); and the "caret" and "gbm" packages (for cross- 
validation analyses). Analysis code is available at https://github.com/joelewnard/global-abx/ 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


Analysis code is available from http://github.com/joelewnard to replicate all parts of the analysis and generation of figures. MICS data are publicly available at 
http://mics.unicef.org/. DHS data are available upon request at https://dhsprogram.com/Data/. Files posted to the author’s github repository above include 
reduced versions of the DHS and MICS datasets necessary to replicate analyses. 
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Behavioural & social sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description We conducted a case-control study of the association of pneumococcal conjugate vaccine and rotavirus vaccine receipt with mother- 
reported acute respiratory infection and diarrhea and related care-seeking among children under five years of age in low- and middle- 
income countries, and estimated the vaccine-preventable burden of antibiotic use associated with these infections. 
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Research sample Children under five years of age residing in low- and middle-income countries, whose mothers responded to Demographic Health Survey 
(DHS) and Multiple Indicator Cluster Survey (MICS) rounds. Children under five years old are of interest as the population at greatest risk 
of antibiotic-treated ARI and diarrhea episodes which may be preventable by pneumococcal conjugate vaccines and rotavirus vaccines. 
The DHS and MICS surveys address household composition and risk factors, health outcomes, and heatlhcare utilization among 
household occupants. The design of the DHS and MICS sampling strategies provides a nationally representative selection of households, 
enabling valid estimation of incidence rates and probabilities of care seeking or antibiotic receipt, given illness; data were available from 
77 countries, in total, for our analyses. The DHS/MICS studies provide the largest sample of individual-level observations on ARI and 
diarrhea outcomes, vaccination status, and risk factors among children in LMICs. 


Sampling strategy DHS and MICS use a probability-based geographic sampling scheme to select clusters within countries (or provinces of countries), and 
households within clusters; as our analyses were of a secondary nature, we could not dictate statistical power by direct enrollment of 
study subjects, and used data available from all children. For the case-control study, data were available on 65,815 children, including 
5342 ARI cases and 9944 diarrhea cases. Based on the number of children available for analysis, we expected sufficient statistical power 
to estimate vaccine direct effects against all-cause disease endpoints under a 1:3 matched case-control design. Our analysis of risk factor 
data to estimate incidence rates included all 944,173 children for whom data were available from DHS/MICS surveys beginning in 2006; 
in this large sample, we likewise expected sufficient statistical power would be available for estimation of the effects of individual risk 
factors on ARI and diarrhea outcomes. 


Data collection We conducted secondary analyses of the DHS and MICS datasets; data collection for the original surveys is conducted using standardized, 
pre-piloted questionnaires and is described elsewhere (see https://dhsprogram.com/ and http://mics.unicef.org/). 


Timing We used data collected in surveys administered from January 1, 2006 to December 31, 2018. For the case-control study, we used data 
collected from January 1, 2015 onward; prior surveys did not collect card-confirmed rotavirus and/or pneumococcal vaccination status. 


Data exclusions For the case-control study, we excluded children from whom the following data were unavailable: outcome (ARI or diarrhea and related 
care-seeking), exposure (vaccination status), and covariates used in the matching procedure (including age, visit timing, country, 
urbanicity, household wealth quintile, and pentavalent vaccine doses received). For other analyses (development of the burden model), 
we limited data to the most recent DHS/MICS survey round undertaken in each country and did not exclude children with missing data; 
missing variables were multiply imputed using the Amelia II package in R. 


Non-participation Our analyses included data from all mothers who consented to participate in DHS/MICS survey rounds; participation in the original 
surveys was voluntary, and respondents were free to decline to participate or to refuse to answer any question. We conducted multiple 
imputation to analyze answers to questions for which answers were not available (because the respondent refused to answer or was 
unsure of the answer). 


Randomization Our study was not randomized. For the case-control analyses, we controlled for the following variables via exact matching: age, visit 
timing, country, urbanicity, household wealth quintile, and pentavalent vaccine doses received. In addition, we validated estimates by 
testing for vaccine effects against negative control conditions using the same analysis framework (PCV effects against diarrhea outcomes, 
and rotavirus vaccine effects against ARI outcomes). Our burden model aimed to estimate incidence rates as a function of multiple 
individual risk factors, for which causal inference was not the focus of analyses. 
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Immune evasion is a major obstacle for cancer treatment. Common mechanisms of 
evasion include impaired antigen presentation caused by mutations or loss of 
heterozygosity of the major histocompatibility complex class I (MHC-I), which has 
been implicated in resistance to immune checkpoint blockade (ICB) therapy’ ®. 
However, in pancreatic ductal adenocarcinoma (PDAC), whichis resistant to most 
therapies including ICB‘*, mutations that cause loss of MHC-lare rarely found’ despite 
the frequent downregulation of MHC-I expression® ®. Here we show that, in PDAC, 
MHC-I molecules are selectively targeted for lysosomal degradation by an 
autophagy-dependent mechanism that involves the autophagy cargo receptor NBRI. 
PDAC cells display reduced expression of MHC-1at the cell surface and instead 
demonstrate predominant localization within autophagosomes and lysosomes. 
Notably, inhibition of autophagy restores surface levels of MHC-I and leads to 
improved antigen presentation, enhanced anti-tumour T cell responses and reduced 
tumour growth in syngeneic host mice. Accordingly, the anti-tumour effects of 
autophagy inhibition are reversed by depleting CD8* T cells or reducing surface 
expression of MHC-I. Inhibition of autophagy, either genetically or pharmacologically 
with chloroquine, synergizes with dual ICB therapy (anti-PD1 and anti-CTLA4 
antibodies), and leads to an enhanced anti-tumour immune response. Our findings 
demonstrate a role for enhanced autophagy or lysosome function in immune evasion 
by selective targeting of MHC-I molecules for degradation, and provide a rationale for 
the combination of autophagy inhibition and dual ICB therapy as a therapeutic 
strategy against PDAC. 


Human PDAC celllines expressed heterogeneous levels of total MHC-I 
protein (Fig. la), and notably, exhibited a punctate cytoplasmic dis- 
tribution of MHC-I that co-localized with lysosomes (Fig. 1b). By con- 
trast, non-transformed human pancreatic ductal epithelial (HPDE) 
cells showed predominant localization of MHC-I on the plasma mem- 
brane (Fig. 1b). Indeed, MHC-I molecules were highly enriched in PDAC 
lysosomes as compared to HPDE lysosomes (Fig. 1c, Extended Data 
Fig. 1a, b). Moreover, lysosomal inhibition resulted in accumulation of 
MHC-I within lysosomes, which confirms that MHC-lis actively routed 
to the lysosome for degradation (Fig. 1d). A substantial fraction of 
the MHC-I puncta also co-localized with LC3B-labelled autophago- 
somes in PDAC cells, consistent with the increased autophagy levels 
in PDAC? " (Fig. le). Notably, similar phenotypes were observed in 


several non-small-cell lung cancer (NSCLC) cell lines (Extended Data 
Fig. 1c, d). Flow cytometry-based analysis of total intracellular versus 
plasma membrane MHC-I confirmed a higher relative abundance of 
intracellular MHC-I in most PDAC cell lines (Fig. 1f). Similarly, surface 
levels of MHC-I were lower in PDAC cells derived from a genetically 
engineered mouse model (GEMM) of PDAC” than in normal pancreas 
cells (Extended Data Fig. le). Furthermore, immunofluorescence 
staining revealed that all human PDAC tumours analysed contained 
considerable regions with intracellular MHC-I localization (Fig. 1g, 
Extended Data Fig. 1f), supporting our in vitro findings. Together, these 
data suggest that MHC-I molecules are reduced at the cell surface and 
predominantly localized within autophagosomes and lysosomes in 
PDAC. Autophagy inhibition by ATG3 and ATG7 knockdown as well as 
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Fig. 1| MHC-Iis enriched in lysosomes of PDAC cells and displays reduced 
cell surface expression. a, Levels of MHC-I (HLA-A, -B, -C) in HPDE and human 
PDAC cell lines. b, Left, localization of MHC-I (green) relative to LAMP1-positive 
(red) lysosomes. Scale bars, 20 pm. Right, the percentage co-localization 
(n=14-20 fields). c, Presence of MHC-linimmuno-isolated lysosomes. 

d, Accumulation of MHC-linimmuno-isolated lysosomes after treatment with 
lysosome inhibitors E64d and pepstatin A (pep) for 6h. e, Left, localization of 
MHC-I (green) relative to LC3B-positive (red) autophagosomes. Scale bars, 

20 um. Right, the percentage co-localization (n=14-20 fields). f, Flow 
cytometry-based analysis of intracellular versus plasma membrane (PM) MHC-I 


lysosomal inhibition with V-ATPase inhibitor bafilomycin A1 (BafA1) 
increased total and plasma membrane MHC-I levels in PDAC cells 
(Fig. 2a, b, Extended Data Fig. 2a—i). Moreover, surface MHC-I levels 
of Atg5 mouse PDAC cells"° were higher than those of Atg5"”* PDAC 
cells (Extended Data Fig. 2j). Notably, lysosomal inhibition with BafA1 
or chloroquine increased levels of MHC-I proteins but did not affect 
those involved in antigen processing and presentation (Extended Data 
Fig. 2k, 1), which suggests that autophagy inhibition would not impair 
these steps. Similar phenotypes were also observed in several NSCLC 
cell lines (Extended Data Fig. 2m-o). 

Previous studies have revealed non-canonical functions for a sub- 
set of autophagy proteins in alternative trafficking pathways such 
as LC3-associated phagocytosis (LAP) and endocytosis (LANDO)"™. 
Knockdown of FIP200, which is required for autophagy but not for 
LAP or LANDO, led to an increase in MHC-I levels (Extended Data 
Fig. 3a—c). Similarly, knockdown of ATG14, ATG13 or ULK1, all required 
for autophagy but dispensable for LAP, increased surface MHC-I levels 
(Extended Data Fig. 3d-j). By contrast, no obvious increase in surface 
MHC-I was observed after knockdown of RUBICON, which is required 
for LAP and LANDO but not for autophagy (Extended Data Fig. 3k, 1). 
Collectively, these data suggest a specific role for macroautophagy in 
the trafficking of MHC-I to the lysosome. 


NBRI1 mediates selective autophagy of MHC-I 


Autophagy can selectively degrade target molecules using autophagy 
cargo receptor proteins that bind to and recruit substrates to 
autophagosomal membranes”. To identify autophagy receptor 
protein(s) involved in MHC-I degradation in PDAC cells, we generated 
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levels. Graph shows higher intracellular MHC-I relative to plasma membrane 
MHC-lin PDAC cells (n=9 replicates pooled from 3 independent experiments 
per cell line). Dataare mean +s.d. g, Top, intracellular localization of MHC-I 
(green) in CK19-positive (red) ducts froma specimen froma patient with PDAC. 
Scale bars, 20 pm. Bottom, the percentage of ducts showing intracellular 
MHC-I localization. A representative of at least two independent experiments 
is shownina, c,d. For box-and-whisker plots (b, e), centre lines indicate median 
values and whiskers represent minimum and maximum values. Pvalues were 
determined by unpaired two-tailed t-tests. NS, not significant. See Source Data 
for exact n values for b, e. See Supplementary Fig. 1 for gel source data. 


a proximity biotinylation assay in which the C terminus of HLA-A was 
fused to the biotin ligase TurboID” and Flag (HLA-A-TrID). After addi- 
tion of biotin, HLA-A-TrID covalently tags endogenous proteins within 
afew nanometres of the ligase with biotin (Extended Data Fig. 4a, b). 
Among the autophagy receptors tested, only NBRI showed signifi- 
cant biotinylation, indicating that NBR1 interacts with MHC-I (Fig. 2c). 
Furthermore, immunofluorescence revealed more frequent 
co-localization between NBR1 and MHC-1 in PDAC cells relative to 
HPDE cells (Fig. 2d). NBR1 has been shown to interact with and tar- 
get ubiquitylated substrates for degradation”. Indeed, MHC-I is 
poly-ubiquitylated in PDAC cells (Fig. 2e, Extended Data Fig. 4c) whereas 
LC3B is not ubiquitylated and EGFR is mono-ubiquitylated, as previ- 
ously described’. Accordingly, NBR1 lacking its ubiquitin-associated 
(UBA) domain was unable to co-localize with MHC-I, despite retaining 
localization with LC3B (Fig. 2f, Extended Data Fig. 4d). Finally, NBR1 
knockdown increased total and plasma membrane MHC-I levels in 
PDAC cells (Fig. 2g, h, Extended Data Fig. 4e, f), confirming a role for 
NBR1in MHC-I regulation. Together, these data demonstrate that sur- 
face MHC-I is decreased in PDAC via an NBR1-mediated autophagy- 
lysosomal pathway. 


Autophagy inhibition enhances anti-tumour immunity 


CD8* cytotoxic T cells have crucial roles in anti-tumour immunity. We 
hypothesized that reduced surface levels of MHC-I on PDAC cells may 
facilitate their evasion from CD8" T cells, which recognize tumour 
antigens presented by MHC-I. To test this, we used mouse PDAC 
cells derived from C57BL/6 mice and engineered them to express a 
doxycycline (Dox)-inducible dominant-negative mutant of ATG4B 
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Fig. 2|NBR1 promotes MHC-I trafficking to the lysosome throughan 
autophagy-dependent pathway. a, Effect of short hairpin RNA 
(shRNA)-mediated ATG3 knockdown on levels of HLA-A, HLA-B and HLA-C in 
human PaTu8902 cells. b, Flow cytometry-based quantification of plasma 
membrane levels of MHC-I (HLA-A, -B, -C) (PaTu8902,n=9, 8, 9; KP4,n=9 per 
group; data pooled from three independent experiments) after 
shRNA-mediated ATG3 knockdown. c, Left, KP4 cells expressing HLA-A-TrID 
were labelled with 10 pM biotin for 30 min. Biotinylation of proteins was 
detected after streptavidin pull down. Asterisk denotes self-biotinylation of 
HLA-A-TrID. IP, immunoprecipitation. Right, enrichment for each receptor 
(n=4 independent experiments). d, Localization of MHC-I (red) relative to 
GFP-NBRI1 (green) in HPDE and PaTu8902 cells. Arrowheads show examples of 
co-localization. Scale bars, 20 um. Graph shows quantification of 
co-localization (HPDE, n= 23 fields; Patu8902, n=20 fields). e, Endogenous 
ubiquitylated proteins were affinity captured from PaTu8902 cells with 


(ATG4B(C74A)), which potently inhibits autophagy”. Dox treat- 
ment efficiently inhibited autophagy and increased surface MHC-I 
levels (Extended Data Fig. 5a, b) but did not affect other cell-surface 
molecules, such as those that are immune inhibitory (Extended Data 
Fig. 5c). These data support a specific mechanism that links surface 
MHC-1 levels and autophagy. 

To evaluate antigen presentation, the model neoantigen ovalbu- 
min (OVA) was expressed in mouse PDAC cells carrying Dox-inducible 
ATG4B(C74A) (Extended Data Fig. 5d). Dox treatment increased surface 
expression of both MHC-Iand OVA-derived peptide SIINFEKL bound to 
H-2K°, confirming enhanced peptide presentation (Fig. 3a, b, Extended 
Data Fig. Se, f). OVA-specific CD8* T cells (OT-I cells) co-cultured with 
PDAC cells pre-treated with Dox showed higher proliferation and expres- 
sion of IFN-y and TNF than those co-cultured with non-pre-treated PDAC 
cells (Fig. 3c, Extended Data Fig. 5g, h). Accordingly, Dox-treated PDAC 
cells showed reduced viability after co-culture with OT-I cells (Fig. 3d), 
indicating enhanced T-cell-mediated tumour cell killing. Notably, these 
effects were MHC-I specific, as a H-2K°-SIINFEKL blocking antibody 
partially inhibited OT-I proliferation and rescued PDAC cell viability 
in the setting of autophagy inhibition (Fig. 3c, d). Collectively, these 


102 | Nature | Vol581 | 7May 2020 


UBQLNI1-UBA-conjugated beads. Arrowheads indicate MHC-I 
polyubiquitylation. Treatment of affinity-captured samples for 1h with 
purified deubiquitylating enzyme Usp2-cc (+) to induce deubiquitylation leads 
toloss of MHC-I polyubiquitylation. Control proteins: LC3B (no ubiquitylation) 
and EGFR (mono-ubiquitylation). f, Left, endogenous MHC-Ico-localizes with 
wild-type (WT) GFP-NBR1 but not GFP-NBR1 lacking its UBA domain (dUBA). 
Right, quantification of co-localization (GFP-NBRI1; n=17 fields; GFP-NBR1 
dUBA; n=14 fields). Scale bars, 20 pmand 10 pm (inset). g, Effect of 
shRNA-mediated NBR1 knockdown on levels of MHC-I. h, Flow 
cytometry-based quantification of plasma membrane MHC-I(n=9 replicates 
from three independent experiments) after NBR1 knockdown. A representative 
of at least two independent experiments is shown ina, e, g.Dataare meants.d. 
(b,c, h). Box-and-whisker plots are as in Fig. 1. Pvalues were determined by 
unpaired two-tailed t-tests. See Supplementary Fig. 1 for gel source data. 


results indicate that tumour-specific inhibition of autophagy leads to 
increased antigen presentation, which enhances CD8*' T cell prolifera- 
tion, activation and tumour cell killing in vitro. 

To test the effect of autophagy inhibition on anti-tumour immune 
responses in vivo, mouse PDAC cells expressing Dox-inducible 
mStrawberry (mSt) or mSt-ATG4B(C74A) (4B) (Extended Data Fig. 6a) 
were orthotopically transplanted into syngeneic (C57BL/6) mice. 
Autophagy-inhibited cells (4B) formed smaller tumours with higher 
MHC-I expression than control cells (mSt) while PD-L1 expression 
was unchanged (Fig. 3e-g, Extended Data Figs. 6b-f, 7k). Moreover, 
autophagy-inhibited tumours (4B) exhibited a significant increase in 
infiltrating CD8* T cells and a decrease in myeloid-derived suppres- 
sor cells, a major immunosuppressive cell type in PDAC tumours”’ 
(Fig. 3h, Extended Data Fig. 6g-j). Similarly, knockdown of ATG7 
resulted in a significant reduction in tumour burden and an increase 
in tumour-infiltrating T cells (Extended Data Fig. 6k—n). Notably, 
there was a significant correlation between smaller tumour sizes 
and increased CD8*' T cell infiltration (Extended Data Fig. 60), sup- 
porting a role of T cell immunity in control of autophagy-deficient 
tumours. Similar results were obtained ina liver metastasis model: mice 
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Fig. 3 | Autophagy inhibition enhances anti-tumour T cell response. 

a, b, Surface H-2K? (a) and H-2K°-SIINFEKL (b) measured by flowcytometry. 
Mouse PDAC cells expressing OVA and Dox-inducible mTurquoise2- 
ATG4B(C74A) were grownas organoids and treated with or without Dox (lng 
ml") for 96h (n=4 per group). MFI, mean fluorescent intensity. c,d, Co-culture 
of OT-I cells with HY19636 cells shown in aandb. After 48 h, OT-I proliferation 
was measured by CFSE dilution (n=4 per group) (c) and PDAC cell viability was 
measured by Cell-Titer Glo (n=6 per group) (d). e-r, HY15549 cells carrying 
Dox-inducible mStrawberry (mSt) or mSt-ATG4B(C74A) (4B) were 
orthotopically (e-h, m-r) or intrasplenically (i-I) injected into syngeneic mice 
(C57BL/6). The expression of MHC-I and PD-L1on PDAC cells (f, g,j, k, p) and 
tumour-infiltrating CD8* T cells (h, I, n, q) quantified by flow cytometry. 


injected with autophagy-inhibited cells (4B) exhibited lower meta- 
static burden, higher MHC-I expression on cancer cells, and more 
tumour-infiltrating CD8* T cells than mice injected with control cells 
(mSt) (Fig. 3i-I, Extended Data Fig. 6p-t). Together, these results con- 
firm that autophagy-inhibition restores surface levels of MHC-I on 
cancer cells and enhances anti-tumour T cell response in vivo. 
Notably, antibody-mediated CD8* T cell depletion restored the 
growth of autophagy-inhibited tumours (4B) (Fig. 3m, Extended 
Data Fig. 7a—c), confirming the role of CD8* T cells in tumour control. 
We also find that mice deficient for CD103* dendritic cells (Batf3’)*, 
which have pivotal roles in CD8* T cell priming” and recruitment into 
tumours”°??, showed almost complete loss of tumour-infiltrating 
CD8* T cells (Fig. 3n, Extended Data Fig. 7d-f). Accordingly, growth 
of autophagy-inhibited tumours (4B) was restored in Batf3 mice 
(Fig. 30). These results indicate that the anti-tumour effects of 
tumour-specific autophagy inhibition are mediated, at least in part, 
by CD8*' T cells, and this process requires CD103* dendritic cells. 
Finally, to confirm the effect of increased MHC-I expression after 
autophagy inhibition on tumour growth in vivo, cell surface levels of 
MHC-I were depleted by knockdown of beta-2 microglobulin (B2M), 
a crucial component of the MHC-I complex (Extended Data Fig. 7g). 
B2M knockdown led to MHC-I depletion in vivo, decreased the number 
of CD8*T cells in autophagy-inhibited tumours (4B), and rescued the 
growth of autophagy-inhibited tumours (4B) (Fig. 3p-r, Extended Data 
Fig. 7h-k). We also found that tumour-infiltrating CD103° DCs, which 
are increased in autophagy-inhibited tumours (4B), were decreased 


e-h, Orthotopic tumours obtained on day 20 (mSt, n=8; 4B, n=7).i-l, Livers 
obtained on day 15 (n=4 per group). e, i, Weight of tumours (e) and livers (i). 
f-h,j-l, Flowcytometry analysis. m, Weight of tumours after treatment with 
isotype control IgG or neutralizing monoclonal antibody against CD8 (n=7 per 
group).n, 0, Tumours in wild-type or Batf3“ knockout (KO) mice (n=8, 4,8 and 
5; left to right).n, Quantification of CD8* T cells. o, Tumour weight. 

p-r, Tumours expressing control shRNA (Scr) or shRNA against B2m (n= 8 per 
group) collected on day 20. p,q, Flowcytometry analysis. r, Tumour weight. 
Data are mean +s.d. (a—d) ors.e.m. (e-r). nindicates biological replicates (a-d) 
or individual mice (e-r). For a-m, p-r, experiments were performed at least 
twice and representative data of one experiment are shown. Pvalues were 
determined by unpaired two-tailed t-tests. 


after B2M knockdown (Extended Data Fig. 7f, j). This is in line with a 
recent study showing that MHC-I-restricted antigen-recognition by 
CD8* T cells can trigger activation of dendritic cells, which further aug- 
ments recruitment of CD8* T cells into the tumour”. Overall, these data 
indicate that increased surface expression of MHC-I on PDAC cells after 
tumour-specific inhibition of autophagy is a prerequisite for increased 
CD8*T cell infiltration and tumour cell killing. 


Basal autophagy determines immunogenicity 


Our data led us to hypothesize that basal autophagy flux might deter- 
mine the immunogenicity of PDAC cells. Mouse PDAC cells grown as 
organoids” exhibited considerable heterogeneity in autophagy flux 
(measured as the ratio of green to red fluorescent protein signal, GFP/ 
RFP), which closely resembled that of orthotopic tumours (Extended 
Data Fig. 8a, b). Taking advantage of this heterogeneity, PDAC cells 
with the lowest and highest 20% of GFP/RFP ratio were isolated as the 
autophagy-high (AThi) and autophagy-low (ATIo) cells (Extended Data 
Fig. 8c). Transcriptome analysis confirmed upregulation of autophagy 
or lysosome related genes and the MiT/TFE transcription factors, mas- 
ter regulators of autophagy or lysosome gene expression”, in the AThi 
population (Extended Data Fig. 8d, e). Notably, AThi and ATlo cells 
derived from a clonal population showed a similar transcriptional 
profile (Extended Data Fig. 8f), suggesting that diversities in basal 
autophagy flux arise not from genetic variations but instead from het- 
erogeneous expression of the autophagy or lysosome gene program”. 
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Consistent with the cell-autonomous roles of autophagy in PDAC””, 
ATlo cells exhibited reduced clonogenic capacity in vitro (Extended 
Data Fig. 8g, h), which is a similar result to the decreased clonogenic 
growth of ATG4B(C74A)-expressing cells (Extended Data Fig. 8i). After 
orthotopic transplantation into syngeneic mouse hosts, ATlo cells 
gave rise to smaller tumours than AThi cells (Extended Data Fig. 9a). 
Notably, this was reproduced using cells sorted from a clonal popula- 
tion (Extended Data Fig. 9b), which suggests that tumour-cell intrinsic 
factors account for the observed phenotypes rather than differences 
in retroviral-vector integration sites or copy numbers of the reporter. 
Furthermore, ATlo-derived tumours exhibited higher MHC-I expression 
and more tumour-infiltrating CD8" T cells, which inversely correlated 
with tumour weight, compared to AThi-derived tumours (Extended 
Data Fig. 9c-f). This growth advantage of AThi-derived tumours was 
lost in nude mice (Extended Data Fig. 9g), confirming a role of T cells 
in the suppression of ATlo-derived tumours. In the liver metastasis 
model, mice injected with ATlo cells exhibited lower metastatic burden 
and higher MHC-I expression on PDAC cells than mice injected with 
AThicells (Extended Data Fig. 9h-j). CD8" T cells tended to accumulate 
around PDAC cells with low-autophagy flux, whereas CD8' T cells were 
scarce around PDAC cells with high-autophagy flux (Extended Data 
Fig. 9k). These results, along with the data in autophagy inhibition 
models, indicate that autophagy is a crucial determinant of immuno- 
genicity in PDAC cells. 


Autophagy inhibition sensitizes PDAC to dual ICB 


PDAC is refractory to ICB*. To test whether inhibition of autophagy 
might sensitize PDAC to ICB, we treated established syngeneic ortho- 
topic tumours with anti-PD-1 antibody alone or dual ICB (anti-PD-1 
and anti-CTLA4 antibodies). Consistent with a recent clinical trial‘, 
control tumours (mSt) did not respond to either treatment. By contrast, 
autophagy-inhibited tumours (4B) responded significantly to dual 
ICB (Fig. 4a—c), but not to anti-PD-1 antibody alone (Extended Data 
Fig. 10a—c). These tumours showed increased infiltration with bulk CD8* 
Tcells and PD1°TIM3 cells (Fig. 4d, Extended Data Fig. 10d—-g), which 
retain greater function than the most severely exhausted or dysfunc- 
tional PD1‘TIM3* cells”®. These results indicate that tumour-specific 
autophagy inhibition sensitizes PDAC tumours to dual ICB. 

Finally, we assessed the translatability of our findings to systemic 
autophagy inhibition using chloroquine—a clinically available anti- 
malarial agent that inhibits acidification of the lysosome and has been 
used to inhibit autophagy in patients” ”’. Treatment with lysosomal 
inhibitors chloroquine or BafA1 increased surface levels of MHC-I in 
mouse PDAC cells in vitro (Extended Data Fig. 10h). Chloroquine treat- 
ment also increased surface expression of MHC-I, but not PD-L1, in 
orthotopic tumours (Extended Data Fig. 10i-k). However, chloroquine 
monotherapy failed to significantly reduce tumour weight or increase 
the infiltration of T cells (Extended Data Fig. 101, m), possibly due to 
the unfavourable pharmacokinetics of this drug in vivo’. Notably, the 
combination of chloroquine and dual ICB exerted potent anti-tumour 
activity (Fig. 4e-i) and a reduction in autophagy flux was confirmed in 
chloroquine-treated tumours (Fig. 4j, Extended Data Fig. 10n). Moreo- 
ver, tumours treated with chloroquine plus ICB exhibited increased 
infiltrations of CD8* T cells (Fig. 4k) and an increase in the number of 
functional PD1*TIM3 CD8* T cells (Extended Data Fig. 100). Overall, 
these data indicate that autophagy inhibition, either in cancer cells 
alone or systemically, sensitizes PDAC tumours to dual ICB therapy 
(Fig. 41). 


Discussion 


Our results suggest that autophagy is a crucial regulator of immuno- 
genicity in PDAC cells. This is in line with a recent study that showed that 
the lysosomal pathway is strongly correlated with reduced infiltration 
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Fig. 4| Autophagy inhibition sensitizes PDAC to dual ICB. a-d, Mice bearing 
orthotopic tumours (HY15549) expressing Dox-inducible mSt or 4B received 
isotype control IgG or dual ICB (anti-PD1/CTLA4 monoclonal antibodies; mAb) 
(n=7 per group).a, Study design. b, c, Images (b) and weight (c) of tumours. 

d, Quantification of tumour-infiltrating CD8* T cells by flow cytometry. 

e-k, Mice bearing orthotopic tumours (HY15549) expressing the GFP-LC3- 
RFP reporter received chloroquine (CQ) and ICB (n=8 per group). e, Study 
design. q24h, every 24 hours. f, g, Images (f) and weight (g) of tumours. No 
macroscopic tumour was identified in three mice receiving chloroquine plus 
dual ICB (#3, 5 and 6).h, Response rates from two independent experiments. 
Responseis defined as more than 80% reduction in tumour weight as compared 
with control tumours (PBS + IgG). N/A, not applicable. i, Representative 
haematoxylin and eosin (H&E) images of the pancreas undergoing tumour 
regression (#6). White dashed line indicates tumour remnants. Scale bars, 250 
and 100 pm (inset).j, Autophagy flux represented by GFP/RFP ratio per 20 x 
field (n=49, 39, 47 and 54; left to right). Increased GFP/RFP ratio indicates 
reduced autophagy flux. k, Quantification of tumour-infiltrating CD8* T cells 
by flowcytometry (n=8, 8,8 and 5; left to right). I, In PDAC cells, surface MHC-I 
is downregulated by active degradation through the autophagy-lysosome 
system, contributing to the primary resistance toICB. NBR1 binds to MHC-I, 
facilitating its trafficking to autophagosomes (left). Inhibition of autophagy or 
the lysosome restores surface MHC-I expression, leading to enhanced 
anti-tumour T cell immunity and improved response to ICB (right). Data are 
mean +s.e.m. (c,d, g,k) ors.d. (j). nindicates individual mice (c,d, g, k) or 
individual 20~ fields (j). Allexperiments were performed twice and 
representative data of one experiment are shown. P values were determined by 
unpaired two-tailed ¢-tests. 


of CD8*T cells inhuman PDAC*’. In addition, autophagy-related genes 
are enriched in MHC-I negative PDAC cells that reside in liver metas- 
tasis®, also suggesting a role for autophagy as a negative regulator 
of MHC-I. Given the critical roles for the autophagy and lysosome 


system in supporting PDAC metabolism and growth, our data on 
immune evasion add to the growing list of cell-autonomous and 
non-cell-autonomous functions of the autophagy and lysosome sys- 
tem in PDAC pathogenesis!?”. 

We found that systemic autophagy inhibition by chloroquine, as 
well as tumour-specific autophagy inhibition, sensitizes PDAC to dual 
ICB. In addition, recent evidence accounts for the improved therapeu- 
tic effects of systemic autophagy inhibition. First, host autophagy 
supports tumour growth by providing nutrients™ °°. Second, loss of 
autophagy proteins or chloroquine treatment increases surface levels 
of MHC-1in dendritic cells, leading to enhanced CD8’ T cell response 
in models of virus infection. Also, autophagy inhibition directly 
enhances anti-tumour activity of CD8* T cells®. Finally, loss of LAP, a 
process also inhibited by chloroquine, polarizes tumour-associated 
macrophages into atumour-suppressive phenotype (M1), promoting 
anti-tumour T cell responses”. In line with these, clinical trials have 
shown that hydroxychloroquine, a derivative of chloroquine, has activ- 
ity in patients with PDAC?®”’. Whether the addition of ICB would be 
synergistic remains to be determined. 

In this study, we focused on CD8*° T cells, given their direct interac- 
tion with MHC-I on cancer cells. However, we also observed changes 
in other immune cells suchas myeloid-derived suppressor cells, CD4* 
T cells and CD103* dendritic cells after inhibition of autophagy. Inves- 
tigating how these and changes in other immune cell types are medi- 
ated, such as the potential involvement of secreted proteins from 
PDAC cells after autophagy inhibition, will be important subjects of 
future work. 

Despite the evidence mentioned above and previous work dem- 
onstrating that inhibition of autophagy does not impair anti-tumour 
adaptive immunity”, autophagy inhibition has also been reported to 
effect some aspects of the immune system such as memory forma- 
tion in virus-specific CD8* T cells and CD4* T cell priming by dendritic 
cells?”?° and may impair chemotherapy-induced immunogenic cell 
death”. Therefore, further studies will be needed to define more sub- 
tle aspects of the immune response after autophagy inhibition and 
howto best combine autophagy or lysosome blockade with cytotoxic 
and immune-based therapies. For example, deeper insights into the 
potential effect of basal immunogenicity” on response to autophagy 
inhibition, the differences in mutational burdenin mouse and human 
tumours”, the effect of potential dominant antigens in experimental 
systems“, and the heterogeneity described here in terms of MHC-I 
expression in patients will probably contribute to the successful clini- 
cal translation of our findings. Importantly, the mechanistic insights 
described here about how autophagy can promote immune evasion, 
provide strong rationale to pursue these studies with the ultimate goal 
of developing new therapeutic approaches for patients with PDAC. 
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Methods 


Cell culture 

The cell lines PaTu-8988T, KP4, MiaPaca2, Panc 2.03, PaTu-8902, Pancl, 
AsPcl, HupT3 and A549 were obtained from the American Type Culture 
Collection (ATCC) or the DSMZ. H441, H358, HCT116 and BEAS-2B were 
provided by E. Collisson. HPDE was provided by M. Tsao”. Cells were 
cultured in the following media: PaTu-8988T, KP4, MiaPaca2, PaTu- 
8902, Pancl, AsPcl and BEAS-2B in DMEM supplemented with 10% FBS; 
Panc 2.03 and HupT3 in RPMI with 10% FBS; HPDE cells were cultured 
in keratinocyte serum-free (KSF) medium supplemented by epidermal 
growth factor and bovine pituitary extract (Life Technologies, Inc.), 
supplemented with 1% penicillin/streptomycin (Gibco). Cell lines were 
regularly tested and verified to be mycoplasma negative using Myco- 
Alert Detection Kit (Lonza) or via PCR. 

Primary mouse PDAC cell lines were established from pancreatic 
tumours in respective GEMMsas described previously”. HY15549 and 
HY19636 cells were established from female KPC mice (p48-cre* (p48 
is also knownas Ptfla) Kras‘“°2"", Trp53°")® that were fully back- 
crossed into a C57BL/6 background. The other cell lines were derived 
from the following mice’: Atg5“* KPC cells, Pdx1-cre*, Kras‘* ©)", 
Trp53°"*, Atg5‘* mice; Atg5" KPC cells, Pdx1-cre*, Kras“©?"", Trp53°*, 
Atg5!®/"* mice; AY6284 cells, p48-cre*, Kras‘§©25", Trp53*, Rosa", 
mSt-ATG4B(C74A) mice”. All mouse PDAC cells were maintained in 
DMEM (Corning) supplemented with 10% FBS (Atlanta Biologicals, 
$11550H) and 1% penicillin/streptomycin (Gibco). 

Cells were grown in a humidified incubator with 5% CO, at 37 °C. 
Cultures were routinely verified to be negative for mycoplasma. Cell 
lines were authenticated by fingerprinting, and low passage cultures 
were carefully maintained in a central lab cell bank. 


Constructs 

GFP-NBR1 and GFP-NBRI1 dUBA were provided byJ. Debnath. LysoTag 
TMEM192-mRFP-3xHA (TMRHA) was generated by subcloning the 
cDNA of TMEM192 (Origene) together with monomeric red fluorescent 
protein (mRFP) and 3x haemagglutinin (HA) tag into the Nhel and EcoRI 
sites of pLJM1 lentiviral vector. HLA-A-TurboID-Flag (HLA-A-TrID) was 
generated by subcloning the cDNA of HLA-A (Addgene plasmid, 85162) 
into the EcoRI and Notl sites of the TurboID pLVX vector (gift from R. 
Zoncu). pMXs GFP-LC3-RFP was a gift from N. Mizushima (Addgene, 
plasmid 117413). For Dox-inducible expression of ATG4B(C74A), mTur- 
quoise2 or mStrawberry was fused to ATG4B(C74A) and inserted into 
either pSLIK-Hygro (used for in vitro studies) (Addgene, plasmid 25737) 
or pINDUCER20 (used for in vivo studies) (Addgene, plasmid 44012), 
using the Gateway Cloning system (Thermo Fisher Science). For the 
generation of OVA-expressing cells, the COVA fragment was cloned from 
pCl-neo-cOVA (Addgene, plasmid 25097), fused with 2A peptide and 
mStrawberry sequences using NEBuilder HiFi DNA Assembly Cloning 
Kit (New England BioLabs) according to manufacturer’s instruction, and 
inserted into the EcoRI and Sall sites of pBabe-zeo (Addgene, plasmid 
1766) to generate pBabe-cOVA-2A-mStrawberry. Stable cOVA expres- 
sion was confirmed and monitored with mStrawberry fluorescence. 


shRNAs 
shRNA vectors (pLKO.1 puro) were obtained from the Sigma MISSION 
TRC shRNA library. The sequences and RNAi Consortium clone IDs for 
the shRNAs used are as follows: 

shATG3#1 (human): 5’-GATGTGACCATTGACCATATT-3’ (TRCNOO001 
48120); shATG3#2 (human): 5’-GCTGTCATTCCAACAATAGAA-3’ 
(TRCNO000147381); shATG7#1 (human): 5’-CCCAGCTAT TGGAACAC 
TGTA-3’ (TRCNO000007587); shATG7#2 (human): 5’-GCCTGCTGAG 
GAGCTCTCCAT-3’(TRCNO000007584);shNRB1#1(human):5’-GCTTCAT 
AGTTATTTGGCATT-3’ (TRCNO000123159); shNBR1#2 (human): 
5’-GCAGCATTTGTGGATGAGAAT-3’ (TRCN0O000123160); shNBR1#3 
(human): 5S°GCCAGGAACCAAGTTTATCAA-3’ (TRCNO000123161); 


shFIP200#1 (human): 5’-GCACTCTTTAACACATTCTTT-3’ (TRCNOOOO 
013523); shFIP200#2 (human): 5’- GCTGTGAATGAGTT TGTAATA-3’ 
(TRCNO000013524); shATG14#1 (human): 5’-CCATAGAACT TGGTCA 
TGTTT-3’ (TRCNO000144080); shATG14#2 (human): 5’-GATCAAT TAC 
AACCACTGCAT-3’ (TRCNO000145367); shAtg3#1 (mouse): 5’-CATATCA 
CAACACAGGTATTA-3’ (TRCNO000247440); shAtg3#2 (mouse): 
5’- GIACATCACT TACGACAAATA-3’ (TRCNO000247442); shAtg7#1 
(mouse): 5’-TTCTGTCACGGT TCGATAATG-3’ (TRCNO000305991); 
shAtg7#2 (mouse): 5’- GCCAACATCCCTGGATACAAG-3’ (TRCNOO0O 
375444); shAtg7#3 (mouse): 5’- TCTTACCCTGCTCCATCAAGA-3’ 
(TRCNO000375421); shB2m (mouse): 5’-CCAGT T TCTAATATGCTATAC-3’ 
(TRCNO000295705); shAtg13#1 (mouse): 5’-TGAAGTCTCTTCTCGCT 
ATTA-3’ (TRCNO000277121); shAtg13#2 (mouse): 5’-GACATACCTTTCG 
CCATGTTT-3’ (TRCNO000176029); shUIk1#1 (mouse): 5’-CGCTTCTTT 
CTGGACAAACAA-3’ (TRCNO000319764), shUIk1#2 (mouse): 5’-CGCTT 
CTTTCTGGACAAACAA-3’ (TRCNO000028768); shGFP: 5’-TGCCCGACA 
ACCACTACCTGA-3’ (TRCNO000072186). sShGFP and shScr (Addgene, 
plasmid #17920) were used as controls. 


siRNAs 

The short interfering RNAs (siRNAs) usedin this study are: siNC (silencer 
negative control #1 siRNA, Thermo, 13778030), siRubicon #1 (s104762, 
Ambion), and siRubicon #2 (s104763, Ambion). Cells were transfected 
with siRNAs using Lipofectamine RNAi Max Transfection Reagent (Life 
Technologies). 


Retroviral and lentiviral transduction 

For the transfection of pMXs GFP-LC3-RFP“* and pBabe-cOVA- 
2A-mStrawberry, retrovirus was produced by co-transfection of 
HEK293FT cells with a retroviral vector and the packaging plasmids 
pHit60 and VSVG at a 0.5:0.25:0.25 ratio. For the transfection of lenti- 
viral vectors (pSLIK-hygro, pINDUCER20 and pLKO.1-puro), lentivirus 
was produced by co-transfection of HEK293FT cells with a lentiviral 
vector and the packaging plasmids psPAX2 (Addgene, plasmid 12260) 
and pMD2.G (Addgene, plasmid 12259) at a 0.5:0.25:0.25 ratio. Trans- 
fection was performed using Lipofectamine 3000 Reagent (Thermo 
Fisher Scientific) according to manufacturer’s instructions. The viral 
supernatant was collected 48 h after transfection, filtered through 
a 0.45-um filter, and used for infection together with Polybrene rea- 
gent (EMD Millipore). Cells expressing Dox-inducible constructs were 
sorted for positive fluorescent expression after Dox treatment to select 
inducible cells, and then sorted for no fluorescent expression after 
Dox withdrawal to remove cells with leaking expression, as described 
previously’. 


Immunofluorescence 

Human cell lines were cultured for two days on coverslips coated with 
fibronectin. After two PBS washes, cells were fixed and permeabilized 
with paraformaldehyde for 15 min at room temperature or ice-cold 
methanol for 5 min at —20 °C. Paraformaldehyde-fixed cells were per- 
meabilized with 0.1% saponin. Samples were then blocked with 5% 
normal goat serum for 15 min at room temperature before incubation 
with primary antibodies (Supplementary Table 1) overnight at 4 °C. 
After washing three times with PBS, cells were incubated in secondary 
antibody at room temperature for 20 min. Slides were mounted on 
glass slides using DAPI Fluoromount-G (0100-20, SouthernBiotech) 
and imaged on a Zeiss Laser Scanning Microsope (LSM) 710 using a 
63x objective. Image processing and quantification were performed 
using Image]. 


Lysosome immunoprecipitation 

For lysosome immunoprecipitation experiments, HPDE and human 
PDAC cell lines stably expressing TMRHA were ruptured and intact 
lysosomes from 1-2 mg of total protein per sample was immunopre- 
cipitated using HA-conjugated Dynabeads as previously described*“*. 


Poly-ubiquilin UBA affinity capture 

Cells were washed twice with ice-cold PBS and lysed in lysis buffer 
(SO0mM Tris-HCI pH/7.5,150 mM NaCl, 1% NP-40,10 mMN-ethylmaleimide) 
supplemented fresh with protease inhibitor cocktail. Lysates were incu- 
bated at 4 °C for 15 min and clarified by centrifugation at 20,000 xg at 
4 °C for 15 min. Samples were quantified by BCA Protein Assay Kit and 
diluted to 1 mg mI“ with dilution buffer (50 mM Tris-HCl, pH 7.5, 150 
mM NaCl, 5 mM EDTA, protease inhibitor). Approximately 1-1.5 mg 
of protein lysates were incubated with 50 pl of Ubiquilin 1 Tandem UBA 
Agarose (BostonBiochem AM-130) overnight at 4 °C. Samples were 
then washed three times in High Salt Wash Buffer (SO mM Tris-HCl pH 
7.5,250 mM NaCl, 0.5% NP-40) and once with 10 mM Tris-HCl, pH 7.5. 
Samples were eluted by adding Laemlli buffer and incubating at 65 °C 
for 15-20 min. 

For DUB digestions, affinity captured material was washed once with 
DUB digestion buffer (50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 20 mM 
dithiothreitol). Liquid was removed and beads were resuspended in 20 
ml of DUB digestion buffer and 5 mg of USP2 catalytic domain (Usp2cc; 
BostonBiochem E-506) for 1h with gentle shaking at 30 °C. Beads were 
washed once in high-salt wash buffer and once with 10 mM Tris-HCl, 
pH 7.5, before eluting affinity captured material by adding Laemmli 
buffer and incubating at 65 °C for 15 min. 


Proximity biotinylation 

Human PDAC cells stably expressing HLA-A-TrID were cultured in 
DMEM supplemented with 10% dialysed FBS (DMEM + dFBS) for 48 h. 
Cells were incubated with 10 pM biotin (Sigma) and incubated at 37 °C 
for 30 min. Media was replaced with DMEM + dFBS and incubated at 
37 °C for a further 2-3 h. For negative controls, we omitted exogenous 
biotin. Cells were then washed twice in ice-cold PBS and lysed in ice-cold 
lysis buffer (1% Triton X-100, 130 mM NaCl, 2.5 mM MgCl,,2 mM EGTA, 
25mMHEPES pH 7.4, supplemented fresh with protease inhibitor cock- 
tail) for 30 min onice. Samples were clarified by centrifugation at 13,300 
rpm for 10 min at 4 °C. Protein content was measured using BCA Protein 
Assay Kit (Life Technologies 23227). Then, 1-2 mg of protein lysates 
were incubated with 50 pl of Dynabeads MyOne Streptavidin C1 (Life 
Technologies) overnight. The beads were washed twice in wash buffer 
1(2% SDS in dH20), once in wash buffer 2 (0.1% deoxycholate, 1% Triton 
X-100, 500 mM NaCl, 1mM EDTA, and 50 mM HEPES, pH 7.5), once in 
wash buffer 3 (250 mM LiCl, 0.5% NP-40, 0.5% deoxycholate, 1mMEDTA, 
and 10 mM Tris, pH 8.1), and twice in wash buffer 4 (SO mM Tris, pH 7.4, 
and 50 mM NaCl). Washes were performed at room temperature for 
5 min with gentle agitation. Samples were eluted in Laemlli buffer and 
boiled at 95 °C. Quantification of biotinylation activity was measured 
as intensity of each lane in the +biotin immune-precipitation condition 
divided by the corresponding intensity of the ligase expression band 
in the input. The +biotin ratios were then normalized to the —biotin 
control ratio and to background. 


Organoid culture 

Normal pancreatic organoids from mice were established as previously 
described”>*’ with slight modifications. In brief, pancreas was har- 
vested from female C57BL/6 mice (4-10 weeks of age), mechanically 
minced withscissors, and digested withl mg ml ‘collagenase P (Sigma), 
4 mg mI Dispase II (Sigma), 10 mM HEPES (Thermo Fisher Scien- 
tific), 1% FBS, 1 mg mI™ Trypsin Inhibitor from Soybean (Sigma) 
in DMEM for 20 min at 37 °C. Digested tissues were embed- 
ded in Matrigel (Corning, 356231) and cultured in Advanced 
DMEM (Thermo Fisher Scientific) supplemented with 200 mM 
L-glutamine (Thermo Fisher Scientific), 10 mM HEPES (Thermo 
Fisher Scientific), 1% Antibiotic-Antimycotic (Thermo Fisher 
Scientific), 500 ng ml! recombinant murine rspondin-1 (Pep- 
rotech), 50 ng ml recombinant murine EGF (Peprotech), 100 
ng ml recombinant murine noggin (Peprotech), 1 uM jagged-1 


(188-204) (AnaSpec, AS-61298), 100 pg ml‘ trypsin inhibitor from 
soybean (Sigma), and 10 iM Y-27632 (Enzo). Two days before analysis, 
culture media was replaced with DMEM supplemented with 10% FBS. 
PDAC organoids were generated by embedding mouse PDAC cell lines 
in Matrigel and were grown in DMEM with 10% FBS. For the dissocia- 
tion of organoids for further assays, organoids were digested with 
TrypLE (Thermo Fisher, A1217701) solution at a 2x concentration for 
20 min at 37 °C, followed by filtration through a 40-ym nylon strainer. 


Quantitative proteomics 

Quantitative mass spectrometry-based proteomics was performed 
as previously described**”’ based on the SL-TMT workflow™. In brief, 
cells were lysed in a lysis buffer (200 mM HEPES pH 8.5, 8 M urea, 1x 
Complete Protease Inhibitor Cocktail (Roche), 1 x PhosStop (Roche)) 
and homogenized by passing through a 2-1-gauge needle. Lysates were 
collected by centrifuging at 20,000g for 5 min at 4 °C, followed by 
disulfide bond reduction with 5 mM dithiothreitol at 37 °C for 25 min 
and alkylation with 1OmM iodoacetamide at room temperature for 
30 minin the dark. Chloroform-methanol precipitation of protein was 
performed, followed by protease digestion in HEPES buffer (200 mM, 
pH8.5). Each sample containing 100 1g protein was digested at a1:100 
protease-to-protein ratio with LysC protease at room temperature 
overnight, followed by digestion with trypsin at 37 °C for 6h. Approxi- 
mately 50 pg of peptides from each sample was labelled with 100 pg 
TMT reagent which were dissolved in anhydrous acetonitrile to achieve 
a final concentration of 30% (v/v). TMT-labelled samples was acidi- 
fied, vacuum centrifuged to near dryness and subjected to C18 SPE 
(Sep-Pak, Waters). Samples were subjected to basic pH reversed-phase 
HPLC. Data were obtained with Orbitrap Fusion mass spectrometer 
(Thermo Fisher Scientific) coupled with a Proxeon EASY-nLC 1000 LC 
pump (Thermo Fisher Scientific). Peptide separation was done using 
acustom Accucore C18 resin (2.6 tm, 100 A, Thermo Fisher Scientific) 
column for 3 husinga gradient of 6-30% acetonitrile in 0.125% formic 
acid witha flowrate of 300 nl min“. All analysis involved an MS-based 
TMT method as previously mentioned™ and mass spectra were 
processed as described earlier*’. 


Tumour cell and OT-I cell co-culture experiment 

Mouse PDAC cells stably expressing OVA and carrying Dox-inducible 
mTurquoise2-ATG4B(C74A) were grown as organoids, treated with or 
without Dox (1 pg mI) for 96 h. Organoids were dissociated into sin- 
gle cells, which were then incubated with either anti-H-2K°-SIINFEKL 
antibody (clone 25-D1.16, BioXCell, BEO207) or isotype control (clone 
MOPC-21, BioXCell, BEOO83) at 100 pg mI for 30 min at 4 °C. Total 
splenocytes were harvested from OT-I mice and CD8*° T cells were 
enriched using Dynabeads Untouched Mouse CD8 Cells (Invitro- 
gen, 11417D) following manufacturer’s instructions. Isolated CD8* 
T cells were labelled with 10 pM CFSE (BioLegend) for 10 min at room 
temperature in the dark, washed three times with RPMI-1640 supple- 
mented with 10% FBS. Ten thousand PDAC cells and forty thousand 
CD8* cells were seeded in 96-well plates and cultured in 100 pl 50% 
DMEM and 50% RPMI-1640 supplemented with 10% FBS, 10 ng mI? 
recombinant murine IL-2 (Peprotech), 27.5 tM 2-mercaptoehanol 
(Gibco), and 100 pg mI’ of respective antibodies. After 48 h, CD8* 
T cells were harvested and stained with anti-CD8a antibody (AF647, 
clone 53-6.7, BioLegend) and DAPI, and proliferation was analysed 
by CFSE dilution using flow cytometry. After removal of CD8* T cells, 
the viability of remaining PDAC cells was measured by CellTiter-Glo 
(Promega). 


Clonogenic assay 

For 2D clonogenic assays’, cells were plated in 6-well plates at 300 
cells per well in 2 ml DMEM supplemented with 10% FBS. After 7 days, 
colonies were fixed with 80% methanol, stained with 0.2% crystal violet, 
and counted. For 3D clonogenic assay, single cells were sorted directly 
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into 384-well round bottom Ultra-Low Attachment plates (Corning 
3830) and grownin DMEM supplemented with 10% FBS and 2% Matrigel 
(Corning 356231). After 10 days, the number of wells with spheroids 
was counted. 


Mice 

Female 8-10-week-old C57BL/6 mice or NCr nude (B6NTac, Taconic) 
were used for allograft experiments. OT-I transgenic mice (003831) 
and Batf37" mice (013755) were purchased from Jackson Laboratory. 
All mice were bred and maintained in the animal facility of the New York 
University School of Medicine. All animal procedures were approved 
by the New York University School of Medicine Institutional Animal 
Care and Use Committee (IACUC) under protocol numbers 1A16-00507 
and IA16-01331. 


Mouse experiments 

Orthotopic and intrasplenic injections of PDAC cells were performed 
as described previously”. In brief, mice were anaesthetized by an 
intraperitoneal injection of ketamine and xylazine. A small incision 
was made on the upper left quadrant of the abdomen, and either the 
pancreas or the spleen was externalized. For orthotopic injection, 
cells were suspended in 20 pl of Matrigel (Corning 356231):HBSS (1:1) 
solution and injected into the pancreatic tail with insulin syringes 
(29-gauge needle, BD 324702). Approximately 1 x 10* HY15549 cells 
or 2 x 10* HY19636 cells were injected unless otherwise indicated. 
For intrasplenic injection, 1 x 10° cells were suspended in 100 pl HBSS 
and then drawn into an insulin syringe (28-gauge needle, BD 329461), 
which was pre-loaded with 200 ul HBSS. The externalized spleen was 
divided by ligating clips (Teleflex, 002200), and cells were injected 
into the hemispleen. After injection, splenic vein was ligated with 
ligating clips (Teleflex, 001200) at the hilum of the spleen, and then 
the hemispleen was removed. After the procedures, the peritoneum 
was closed with a 3-0 VICRYL VIOLET suture (Ethicon, J311H), and the 
skin was closed using the BD AutoClip Wound Closing System (BD). For 
some experiments, mice were fed with Dox-containing diet (625 mg 
kg”) for the indicated period. Mice were euthanized at the indicated 
time points and tumours or the liver were collected after trans-cardiac 
perfusion with PBS. 

For CD8*T cell depletion, mice received intraperitoneal injection 
of anti-mouse CD8a antibody (200 pg, clone 53-6.7, BEO004-1) or 
isotype control (200 pg, clone 2A3, BEOO89). For immune check- 
point blockade experiments, mice received intraperitoneal injection 
of anti-mouse PD-1 antibody (200 pg, clone RMP1-14, BEO146) and 
anti-mouse CTLA-4 antibody (200 pg, BEO131), or rat IgG2a isotype 
control (200 pg, clone 2A3, BEO089) and polyclonal Syrian hamster 
IgG (200 pg, BE0087). All antibodies used in in vivo experiments were 
obtained from BioXCell. For chloroquine treatment, mice received 
intraperitoneal injection of chloroquine solution in PBS at 60 mg kg? 
or PBS every day starting day 4 or 5 after tumour cell implantation. 
Mice were randomlly assigned to specific treatment groups at the 
beginning of treatment. 

All experiments were carried out in a clean conventional facility, 
where mice were housed in pre-packaged disposable irradiated cages, 
and fed with irradiated diet and acidified water. Microisolator cages 
were located on ventilated racks. No tumours in the mice exceeded 
IACUC-defined maximum diameters of >2 cm. Sample sizes were 
determined based on our preliminary experiments and no sample 
size calculation was done. Blinding was not performed as the investiga- 
tor needed to know the treatment groups in order to perform study. 
Tumour weights (an objective measurement) were measured only at 
the study endpoints after mice were euthanized and tumours were 
harvested. 


Flow cytometry. For surface and intracellular MHC-I staining of human 
cell lines, cells were stained with Alexa Fluor 488 anti-human HLA-A, 


-B, -C antibody (BioLegend, clone W6/32) at a 1:75 dilution for 45 min 
at 4 °C in the dark and washed with PBS plus 2% FBS and 2 mM EDTA 
(FACS buffer). Cells were then fixed and permeabilized before staining 
with phycoerythrin (PE)-conjugated anti-human HLA-A, -B, -C anti- 
body (BioLegend, clone W6/32) at a 1:75 dilution for 45 min at room 
temperature and washed with FACS buffer. For cell-surface molecule 
staining of mouse cells, single-cell suspensions were prepared as de- 
scribed above. Cells were washed with FCM buffer (HBSS containing 
1% FBS, 1mM EDTA, and 10 mM HEPES) and stained with antibodies 
(Supplementary Table 1) at 4 °C inthe dark for 20 min. Dead cells were 
depleted by DAPI staining. 

For the immunophenotyping of tumours, tissues were mechanically 
minced with scissors, and then digested in DMEM containing 1 mg mI 
collagenase IV (Gibco), 100 pg ml DNase I (Roche), 1% FBS, 10 mM 
HEPES, 2% Antibiotic-Antimycotic (Thermo Fisher Scientific) for 40 
min at 37 °C in the dark with gentle agitation every 10 min. Digested 
tissues were then washed twice in DMEM containing 10% FBS, filtered 
through a40-ym nylon mesh strainer (Corning). Cells were suspended 
in ACK Lysing Buffer (Thermo Fisher Scientific), incubated for 10 min 
at 4 °C inthe dark to remove red blood cells. Cells were washed twice 
in FCM buffer and counted. Cells were stained with Zombie Aqua Fix- 
able Viability Kit (BioLegend) and blocked with anti-mouse CD16/ 
CD32 antibody (Mouse BD Fc Block, clone 2.4G2, BD Bioscience). 
One million cells were incubated with appropriate antibodies (Sup- 
plementary Table 1) diluted in FCM buffer at 4 °C in the dark for 40 
min. Cells were then washed twice with FCM buffer and analysed or 
further fixed in 2% PL solution (PBS containing 0.1 M L-lysine (Sigma) 
and 2% paraformaldehyde (Ted Pella))* at 4 °C in the dark overnight. 
Intracellular staining was performed using Foxp3/Transcription Fac- 
tor Staining Buffer Set (eBioscience). Cells were analysed onaBDLSR 
Fortessa or a BD LSR-II UV and analysed by FlowJo software (FlowJo, 
LLC, v.10.4). Gating strategies were described in Extended Data Fig. 7k 
and Supplementary Table 2. 


Cell sorting 

Mouse PDAC cells (HY15549) expressing the GFP-LC3-RFP reporter 
were grownas organoids for 8 days, collected and dissociated into sin- 
gle cells as mentioned above. Cells were sorted on MoFlo XDP (Beckman 
Coulter) based on the GFP/RFP ratio as described in detail elsewhere. 
The purity of cells after sorting was confirmed by post-sort analyses 
and was usually >90%. 


Histology and immunohistochemistry 

For formalin-fixed paraffin-embedded (FFPE) sections, tissues 
were fixed in 10% buffered formalin, and embedded in paraffin. 
Five-micrometre-thick FFPE sections were used for H&E staining. For 
immunohistochemical staining, FFPE sections were deparaffinized, 
rehydrated and incubated in boiling 10 mM pH 6.0 citrate buffer for 
20 min for antigen retrieval. To visualize the GFP-LC3-RFP reporter 
signal or mStrawberry/mSt-ATG4B(C74A) expression in PDAC cells, 
tissues were fixed in 2% PL solution at 4 °C overnight, incubated 
in PBS containing 30% sucrose, and embedded in Tissu-Tek OCT 
compound (Sakura Finetek) on dry ice*’. For immunohistochemi- 
cal staining, 7-m frozen sections were washed with Tris-buffered 
saline (pH 7.4) containing 0.05% Tween-20 (TBS-T), blocked with 
TBS-T containing 5% goat serum for Ihr at room temperature, and 
stained with primary antibodies (Supplementary Table 1) at 4 °C 
overnight, followed by incubation with fluorescent-conjugated 
secondary antibodies (1:400, Supplementary Table 1) at room tem- 
perature for 1h. Sections were counter stained with 4 pg ml“ Hoechst 
33342 (Thermo Fisher) and mounted in ProLong Diamond Antifade 
Mountant (Thermo Fisher Scientific, P36970). Fluorescent images 
were obtained with Leica DM6 and analysed using LAS X software 
(v.2.0.0.14332.2). Bright light images were obtained with a Leica 
DM2000 bright-field microscope. 


For the measurement of the GFP/RFP signal ratio, frozen sections as 
prepared above were washed with TBS-T three times and coverslips were 
mounted with ProLong Diamond Antifade Mountant. Slides were dried 
at 4 °C overnight. Fluorescence images were obtained from at least 
four random fields per tumour with a 20x objective lens. Greyscale, 
raw-image files (16-bit) were analysed using ImageJ Software to obtain 
mean intensities of GFP and RFP signals. 

Primary human PDAC specimens (Supplementary Table 3) were 
generated under Institutional Review Board (IRB)-approved protocol 
18-25787 at UCSF. Sections were co-stained with antibodies against 
CK-19 (1:300) and HLA-A,B (1:300) (Supplementary Table 1). An average 
of 28 ducts per sample from 9 patient specimens was imaged based on 
expression of CK-19 to ensure analysis of tumour epithelia. The corre- 
sponding MHC-I localization per CK-19 positive ductal structure was 
classified as intracellular if greater than 50% of the staining displayed 
anon-membrane, punctate distribution. 


Quantitative reverse-transcription PCR 

Total RNA was extracted using TRIzol (Thermo Fisher Scientific) and 
PureLink RNA Mini Kit (Thermo Fisher Scientific), and then reversed 
transcribed using Superscript Vilo IV (Thermo Fisher Scientific) with 
oligo-dT primers. Quantitative PCR was performed with SYBR Green 
Supermix (Bio-rad) onthe CFX96 real-time PCR machine (Bio-Rad). The 
quantity of mRNA was calculated using the AC, method and normalized 
by the GAPDH, Actb or 36B4 (also known as RplpO) genes for humans or 
mice, respectively. Sequences for qPCR primers are listed in Supple- 
mentary Table 4. 


Transcriptome analysis 

RNA-seq libraries were prepared using the Illumina TruSeq Stranded 
mRNA library preparation kit according to manufacturer’s instructions 
by the NYU Genome Technology Center. The libraries were pooled 
and sequenced as 50-base, single-end reads on an Illumina HiSeq 
4000 using high output mode (v4 chemistry). The raw fastq reads 
were aligned tomm10 mouse reference genome using STAR aligner™®. 
Fastq Screen was used to check for any contaminations inthe samples 
and Picard RnaSeqMetrics was used to obtain the metrics of all aligned 
RNA-seq reads. featureCounts was used to quantify the gene expres- 
sion levels”. FPKM (fragments per kilobase of transcript per million 
fragments mapped) data were used as input for gene set enrichment 
analysis (GSEA)°*. 


Immunoblotting 

Cells were lysed in cell lysis buffer (Cell Signaling Technology, 
9803) supplemented with protease inhibitor (Thermo Fisher Sci- 
entific, A32953) and phosphatase inhibitor (PhosSTOP, Sigma 
Aldrich, 04906837001). Proteins were separated on 12% or 44-20% 
Mini-PROTEAN TGxX Precast Protein Gels by SDS PAGE electrophoresis 
and transferred to onto PVDF membranes (IPVHOOO10, EMD Millipore) 
or nitrocellulose membranes (GE Healthcare Amersham Protran NC 
Rolls, Fisher, 10600000). Membranes were blocked in 5% non-fat dry 
milk (Blotting-Grade Blocker, BioRad, 1706404) dissolved in TBS-T 
for lhand incubated overnight at 4 °C with primary antibodies (Sup- 
plementary Table 1). Membranes were washed three times with TBS-T 
and incubated with secondary antibodies for 1h at room temperature: 
anti-rabbit IgG, horseradish peroxidase (HRP)-linked (Cell Signaling 
Technology, 7074), anti-mouse IgG, HRP-linked (Cell Signaling Tech- 
nology, 7076). Images were obtained by chemiluminescence (Bio-rad 
1705061) using a ChemiDoc (Bio-Rad). 


Statistical analysis 

Statistical analysis was performed using Graphpad Prism 7.0 (Graph- 
Pad). Results are expressed as mean +s.d. unless otherwise indicated. 
For each box-and-whisker plot, centre line is the median and whiskers 
represent the minimum and maximum values. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


RNA-seq data have been deposited to the Gene Expression Omnibus 
(GEO) data repository with accession number GSE145766. Source Data 
are provided for all experiments. Other data that support the findings 
of this study are available on request from the corresponding author 
upon reasonable request. 
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Extended Data Fig. 1| Heterogeneous distribution of MHC-lin 
KRAS-mutant cancers. a, Immuno-isolation of intact lysosomes from HPDE 
and PDAC cell lines showing absence of non-lysosome markers as indicated. EE, 
early endosome; ER, endoplasmic reticulum; lyso, lysosome; mito, 
mitochondria. b, High-power images showing MHC-I-positive/LAMP1-positive 
(arrowheads), MHC-I-positive/LAMP1-negative (arrows) and MHC-I-negative/ 
LAMP1-positive (asterisk) puncta. Scale bars, 5 um. c, d, Top, localization of 
MHC-I (green) relative to LAMP1-positive (red) lysosomes (BEAS-2B, n=14; 
A549, n=17;H441, n=15; H358, n=13; HCT116, n=13) (c) or LC3B-positive (red) 
autophagosomes (BEAS-2B, n=18; A549, n=17; H441, n=20; H358,n=12; 
HCT116, n=15) (d) inthe indicated cell lines. Bottom, quantification of 
percentage co-localization. Cell lines indicated in red show significantly 
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increased co-localization relative to BEAS-2B cells, and cell lines indicated in 
blue showa modest increase (H358) or no difference (HCT116). Data are 

mean +s.d. (c,d). Scale bars, 20 pm and 10 pm (inset). e, Flowcytometry-based 
analysis of surface MHC-I (H-2) in mouse normal pancreas (C57BI/6) and mouse 
PDAC cells grownas organoids. Top, isotype-subtracted geometric MFI. Each 
dot represents different mice/lines (n=4). Data are mean+s.e.m. Middle, 
representative flow cytometry plots. Bottom, representative images of 
organoids. f, Immunofluorescent staining images froma patient in Fig. 1g 
showing intracellular localization of MHC-I (green) in CK19-positive (red) 
ducts. Scale bar, 20 pm. A representative of at least two independent 
experiments is shown ina and e. Pvalues determined by unpaired two-tailed 
t-tests (c-e). See Supplementary Fig. 1 for gel source data. 
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Extended Data Fig. 2 | See next page for caption. 


Extended Data Fig. 2 | Inhibition of autophagy and lysosomes restores 
MHC-I levels and plasma membrane localization. a, Immunofluorescence 
staining of MHC-l after shaRNA-mediated ATG3 knockdown. Scale bars, 50 pm. 
b, Representative flow cytometry plots for PaTu8902 cells after knockdown of 
ATG3 (related to Fig. 2b) and ATG7 (see also d). Representative plots fromd and 
Fig. 2b are shown. c, Effect of ATG7 knockdown on MHC-I (HLA-A, -B, -C) 
expression in PaTu8902 cells. d, Flow cytometry-based quantification of 
plasma membrane levels of MHC-I (HLA-A, -B, -C) after ATG7 knockdown (n=9 
replicates from three independent experiments). e, Immunofluores-cence 
staining of MHC-I following ATG7 knockdown. Scale bars, 50 pm. f, g, Surface 
MHC-I levels after knockdown of ATG3 (f) or ATG7 (g) in mouse PDAC cells. Left, 
knockdown efficiency was confirmed by immunoblots. Middle, cell surface 
levels of MHC-I (H-2K°/D°) measured by flow cytometry (n=8 replicates from 
two independent experiments). Right, representative flow cytometry plots are 
shown. h, Treatment of KP4 cells with 150 nM BafA1 for the indicated times 
causes an increase in levels of HLA-A, -B. i, Flow cytometry-based 
quantification of plasma membrane MHC-lin the indicated cell lines after 


treatment with BafA1 for 16 h (n=9 replicates from three independent 
experiments). j, Surface MHC-I (H-2) levels measured by flow cytometry. Mouse 
PDAC organoids were established from Atg5“* and Atg5“ KPC cells.n=4 
biological replicates. Data are representative of three independent 
experiments. Right, representative flow cytometry plots. k, Effect of BafAl 
treatment onthe expression levels of antigen presentation machinery. I, 
Quantitative proteomics analysis of Pancl cells that were treated with 
chloroquine (10 1M) for the indicated periods. n=3 biological replicates. m, 
Effect of BafAl treatment on expression levels of MHC-I in the indicated cell 
lines. Cell lines denoted in green show a significant change across all HLA 
isoforms after BafAl treatment. n, 0, Effect of ATG3 knockdownin H441 cells on 
total MHC-I(n) and plasma membrane MHC-I (0) as measured by flow 
cytometry-based quantification (n=9 replicates from three independent 
experiments). A representative of at least two independent experiments is 
shownina-c, e-h,j,k,m,n. Data are meant+s.d. Pvalues were determined by 
unpaired two-tailed t-tests. See Supplementary Fig. 1 for gel source data. 
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Extended Data Fig. 3 | Inhibition of macroautophagy, but not LAP/LANDO, 50 um.b, f,h,j,1, Cell-surface MHC-I levels were measured by flowcytometry 
restores MHC-I levels. Knockdown mediated by shRNA (a-j) or siRNA (k, I) of (b, f,n=9;h,j, n=12;1,n=16). Data are pooled fromat least three independent 


FIP200, ATG14, ATG13 and ULK1, but not RUBICON, increased MHC-1 levels in experiments. Data are meant+s.d.a-f, Patu8902 cells (human). g-I, HY15549 
PDAC cells. a,d,g, i, k, Knockdown efficiency was confirmed byimmunoblot(a, _ cells (mouse). Arepresentative of at least two independent experiments is 
i,k) and qPCR (d, g). Data are mean +s.d. from three biological replicates per shownina,c,e,iandk. Pvalues determined by unpaired two-tailed t-tests. See 


group (d,g).a,e, Whole-cell abundance of MHC-I wasassessed byimmunoblot. Supplementary Fig.1for gel source data. 
c, Immunofluorescence staining of MHC-I (green) and LAMP2 (red). Scale bars, 
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Extended Data Fig. 5 | Autophagy inhibition restores MHC-I expression, 
leading to enhanced anti-tumour T cell response in vitro. a, b, Autophagy 
flux (a) and cell-surface MHC-I levels (b) in PDAC cells measured by flow 
cytometry. Mouse PDAC cells expressing the GFP-LC3-RFP reporter and 
Dox-inducible mTurquoise2-ATG4B(C74A) were grownas organoids for 8 days 
and treated with Dox (1 1g mI’) for the indicated hours. a, Autophagy flux 
represented by GFP/RFP ratio. Note that increased GFP/RFP ratio indicates 
reduced autophagy flux. b, Cell surface MHC-I (H-2K°) levels. Representative 
flow cytometry plots are shown. Data are mean +s.d.n=3 biological replicates. 
Data are representative of at least four independent experiments. c, Fold 
changes of respective molecules onthe cell surface quantified by flow 
cytometry. HY15549 cells expressing Dox-inducible mTurquoise2-tagged 
ATG4B(C74A) were grownas organoids for 8 days and treated with or 

without Dox (1g mI") for 72h. Positive surface expression of each molecule 
was confirmed using respective isotype controls. Molecules found in 
immunological synapses are underlined. TFRC, transferrin receptor. Data are 
meants.d.n=4 biological replicates. Representative data from two 


independent experiments are shown. d-f, Mouse PDAC cells expressing OVA 
and carrying Dox-inducible mTurquoise2-ATG4B(C74A) were grownas 
organoids and treated with or without Dox (1 pg mI) for 96h. Related 

to Fig. 3a, b. d, Autophagy inhibition was confirmed by immunoblot. 
mTurquoise2-ATG4B(C74A) or endogenous ATG4B were detected by 
anti-ATG4B antibody. e, f, Flow cytometry plots for H-2K° (e) and 
H-2K°-SIINFEKL (f). Representative plots from Fig. 3a, b are shown. Grey, 
isotype control. g, Representative flow cytometry plots of the OT-I cells 
co-cultured with mouse PDAC cells from Fig. 3c. h, Quantitative reverse 
transcription PCR (qRT-PCR) analysis of OT-I cells that were co-cultured with 
PDAC cells for 48 h. Related to Fig. 3c. Data are mean+s.d.n=3 biological 
replicates. For gandh, Dox(+) or Dox(-) indicates that PDAC cells were 
grown with or without Dox (1 pg mI”) before co-culture. Dox was not addedin 
co-culture. A representative of at least three independent experiments is 
shownin d-g. ****P< 0.0001, unpaired two-tailed t-tests. See Supplementary 
Fig. 1for gel source data. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6| Autophagy inhibition modulates anti-tumour 
immunity in both orthotopictumours and liver metastasis. a, Immunoblots 
showing autophagy inhibition in mSt-ATG4B(C74A)-expressing cells. Mouse 
PDAC cells carrying Dox-inducible mSt or 4B were treated with Dox (1g mI) 
for the indicated days. mSt or mSt-ATG4B(C74<A) was detected by anti-RFP 
antibody. A representative of two independent experiments is shown. b-j, 
Related to Fig. 3e-h. Mouse PDAC cells shown ina were orthotopically 
transplanted into syngeneic (C57BL/6) mice. HY15549 cells (mSt,n=8;4B,n=7) 
and HY19636 cells (n=8 per group) were injected. b, Study design. c, Images of 
tumours at end point. d~g, HY19636 tumour weight (d), cell surface MHC-I 
levels (e) and PD-L1 levels (f) on PDAC cells and tumour-infiltrating CD8* T cells 
(g) measured by flowcytometry. h, i, Representative H&E staining (h) and 
immunofluorescent staining (i) of HY15549 tumours (mSt, n=8;4B,n=7). Scale 
bars, 100 pm.j, Quantification of tumour-infiltrating immune cells by flow 
cytometry (HY15549, n=8 and 7; HY19636, n=8 per group). Gating strategies 
are shown in Extended Data Fig. 7k and Supplementary Table 2. MDSC, 
myeloid-derived suppressor cells; Treg, T regulatory cells; TAM, 
tumour-associated macrophages. k-o, Autophagy-inhibition by 
shRNA-mediated ATG7 knockdown elicits similar anti-tumour T cell responses. 


k, Immunoblots for ATG7, LC3 and B-actin in PDAC cells (HY15549) expressing 
shRNAs against GFP or ATG7. A representative of at least two independent 
experiments is shown. I-o, Mouse PDAC cells shown ink were orthotopically 
transplanted into syngeneic mice (n=7 per group). I, Images of tumours 
collected on day 22. m, Tumour weight. n, Tumour-infiltrating immune cells as 
measured by flowcytometry. o, Correlation between CD8' T cell frequency 
among CD45‘ cells and tumour weight. p-t, Related to Fig. 3i-l. Autophagy 
inhibition modulates anti-tumour immunity in metastatic tumours in the liver. 
Mouse PDAC cells (HY15549) carrying Dox-inducible mSt or 4B were injected 
into the spleen of syngeneic (C57BL/6) mice that were pre-fed witha 
Dox-containing diet (n=4 per group). PDAC cells were pre-treated with Dox 
(lpg mI‘) for 7 days before injection. p, Study design. q, Images of the liver. r,s, 
Representative images of H&E staining (r) and immunofluorescent staining (s) 
(n=4 per group). Scale bars, 200 pm (r) and 100 pm (s). t, Quantification of 
immune cells in the liver metastasis as measured by flowcytometry. Data are 
mean +s.e.m. n indicates individual mice. Pvalues were determined by 
unpaired two-tailed t-tests (d-g,j, m, n, t) and Pearson correlation analysis (0). 
See Supplementary Fig. 1 for gel source data. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7| Tumour regression after autophagy inhibition is 
rescued by depletion of CD8' T cells or ablation of cell surface MHC-I.a-c, 
Related to Fig. 3m. HY15549 cells with Dox-inducible mSt or 4B were 
orthotopically injected into C57BL/6 mice and fed with Dox-containing diet 
starting on day 5, and then received intraperitoneal injection of anti-CD8 or 
isotype control IgG (n=7 per group). a, Study design. b, Images of tumours. c, 
Tumour-infiltrating leukocytes as quantified by flow cytometry. d-f, Related to 
Fig. 3n, o. HY15549 cells with Dox-inducible mSt or 4B were orthotopically 
injected into C57BL/6 mice (WT) or Batf3~ mice (KO) (n=8,4,8 and 5).d, Study 
design. e, Images of tumours. f, Quantification of tumour-infiltrating 
leukocytes by flow cytometry. g-j, Related to Fig. 3p-r. HY15549 cells carrying 


Dox-inducible mSt or 4B were stably transfected with lentiviral vectors 
expressing control shRNA (shScr, solid line) or shRNA against B2m (shB2m, 
dashed line). g, Cell surface MHC-Ias measured by flow cytometry. Cells were 
treated with IFNy (200 U mI") for 24 hbefore flow cytometry analysis. 
Representative data from three independent experiments are shown. h-j, Cells 
shown ing (4 x 10‘ cells) were orthotopically transplanted into syngeneic 
(C57BL/6) mice that were pre-fed with Dox diet (n=8 per group). h, Study 
design. i, Images of tumours.j, Tumour-infiltrating leukocytes as quantified by 
flow cytometry. k, Gating strategies for flow cytometry analysis of tumours 
used inthis study. See also Supplementary Table 2. Dataare mean+s.e.m.n 
indicates individual mice. Pvalues determined by unpaired two-tailed tests. 
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Extended Data Fig. 8 | Separation of PDAC cells with distinct autophagy 
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pathways enriched in AThi cells compared to ATlocells. Gene set enrichment 


analysis (GSEA) was performed using RNA sequencing (RNA-seq) data from 
sorted AThiand ATlo cells (n=2 and 3 biologically independent samples), 
showing enrichment of the autophagy-lysosome gene signatures in AThi cells 


flux using the GFP-LC3-RFP reporter. Heterogeneity in basal autophagy flux 
was explored using mouse PDAC cells (HY15549) expressing the GFP-LC3-RFP 
reporter. a, b, HY15549 cells were grown as organoids or transplanted into 


C57BL/6 mice to form orthotopic tumours. a, Autophagy flux, as represented 
by GFP/RFP ratio, was measured by flow cytometry. Atg5~ mouse embryonic 


fibroblasts (MEF) with the GFP-LC3-RFP reporter (black) was used as acontrol. 


Representative flow plots from three independent experiments are shown. b, 
Representative fluorescent images of orthotopic tumours. Cells with high 
autophagy flux show GFP-LC3 puncta formation (inset, arrowhead) anda 
decrease in total GFP-fluorescent signals, displaying red appearance inthe 
merged image. Scale bars, 100 pm. c-h, Mouse PDAC organoids were 
dissociated into single cells and sorted into autophagy-high (AThi) or -low 
(ATlo) cells according to the GPF/RFP ratio. c, Sorting strategies. d, Top KEGG 


as compared with the ATlo cells. FDR, false discovery rate; NES, normalized 
enrichment score; Nom., nominal. e, f, Relative mRNA expression of 
autophagy/lysosome-related genes in the respective populations sorted from 
pooled populations (e) or asingle-cell derived clone (f). n=3 technical 
replicates. Representative results from four (e) and two (f) independent 
sorting experiments are shown. g-i, Clonogenic potential of sorted AThi and 
ATlocells (g, h) and PDAC cells with Dox-inducible ATG4B(C74A) (AY6284) (i). 
Representative data from at least two independent experiments are shown. 
n=4(g) andn=3 (i) per group. Data are mean ¢+s.d. (e-i). Pvalues were 
determined by unpaired two-tailed t-tests. 
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Extended Data Fig. 9 | Basal autophagy flux determines immunogenicity of 
PDAC cells. Mouse PDAC cells (HY15549) expressing the GFP-LC3-RFP 
reporter were sorted into AThiand ATlocells (Extended Data Fig. 8c) and 
injected into the pancreas (a-g) or the spleen (h-k) of C57BL/6 mice (a-f, h-k) 
or nude mice (g). Cells were sorted from pooled populations except for (b). a, b, 
Tumour weight on day 21. AThiand ATlo cells were sorted from either pooled 
populations (a) (n=9 and 10) ora single-cell derived clone (b) (n=10 per group). 
c-e, Tumours shown ina were analysed. c, Cell surface MHC-I levels on PDAC 
cells measured by flow cytometry. d, Correlation between MHC-I levels on 
PDAC cells and tumour weight. e, Representative images of H&E (left) and 
immunofluorescent staining (right). Scale bars, 100 pm. f, Quantification of 
tumour-infiltrating immune cells by flowcytometry (n=8 per group). 
Orthotopic tumours obtained on day 21 were analysed. g, Orthotopic tumours 


in nude mice obtained on day 19 (n=8 and 7). h-k, Liver metastasis model.h, 
Study design. i,j, Weight of livers (i) and cell surface MHC-I levels on PDAC cells 
measured by flowcytometry (j) on day 17 (n=5 per group). k, Representative 
immunofluorescence images of livers obtained on day 9 (n=3 per group). 
Frozen sections were stained with anti-CD8a antibody and DAPI. In these 
merged images, cells with high autophagy flux appear as red, reflecting the 
relative loss of GFP-fluorescence and lower GFP/RFP ratio, whereas cells with 
low autophagy flux appear as yellow to green, reflecting high GFP/RFP ratio. In 
the enlarged images, CD8a’ cells were indicated by white arrowheads. Scale 
bars, 100 pm. Fora, c-fandi-k, experiments were performed atleast twice and 
representative data of one experiment are shown. Data are mean+s.e.m. (a-c, 
f,g, i,j). nindicates individual mice. Pvalues were determined by unpaired 
two-tailed t-tests (a-c, f, g, i,j) and Pearson correlation analysis (d). 
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Extended Data Fig. 10| See next page for caption. 
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Extended Data Fig. 10 | Autophagy inhibition synergizes with dual ICB.a-c, 
Anti-PD1 antibody treatment did not affect tumour growth in either control or 
autophagy-inhibited tumours. Mice bearing orthotopic PDAC tumours 
(HY15549) carrying Dox-inducible mSt or 4B were treated with Dox beginning 
on day Sand received either isotype control IgG or anti-PD1 antibody (n=7,8,8 
and7 per group). a, Study design. b, c, Images (b) and weight (c) of tumours. d- 
g, Related to Fig. 4a—d. Mice bearing orthotopic PDAC tumours (HY15549) 
expressing Dox-inducible mSt or 4B were treated with Dox beginning on day 5 
and received either isotype control IgG or dual ICB (anti-PD1/CTLA4 
antibodies) (n=7 per group). d, e, Representative images of 
immunofluorescence staining (d) and H&E staining (e). Scale bars, 100 um. f, g, 
Quantification of tumour-infiltrating immune cells by flow cytometry. h, Cell 


surface MHC-I (H-2K°/D°) levels measured by flow cytometry. Mouse PDAC cells 


were treated with chloroquine or BafA1at the indicated concentrations for 48h 
(n=4). Mouse PDAC cells were grown in 2D culture (chloroquine) or as 


organoids (BafA1). Representative results from at least three independent 
experiments are shown. i-m, Mice bearing orthotopic PDAC tumours 
expressing the GFP-LC3-RFP reporter were treated with PBS or chloroquine 
beginning on day 5 (n=7 vs 6 for HY15549 and n= 8 vs 8 for HY19636). i, Study 
design.j, Images of tumours. k, Cell-surface MHC-I and PD-L1 levels on PDAC 
cells measured by flowcytometry. I, Tumour weight. m, Quantification of 
tumour-infiltrating immune cells by flow cytometry. n, Representative 
fluorescence images of tumours expressing the GFP-LC3-RFP reporter from 
Fig. 4j. Numerical values represent mean fluorescent intensity of each field. o, 
Quantification of tumour-infiltrating immune cells by flowcytometry (n=8,8, 
8 and 5; left to right). Tumours in Fig. 4k were analysed. Data are mean+s.e.m. 
(c, f, g,k-m, o) or +s.d. (h). nindicates individual mice (c, f, g, k-m, 0) or 
biological replicates (h). Except for the orthotopic implantation of HY19636 
cells (j-m), all experiments were performed at least twice and representative 
data of one experiment are shown. ****P< 0.0001, unpaired two-tailed f-tests. 
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For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Flow cytometry data were collected using FACSDiva (BD Biosciences, version 8.0.1). Quantitative PCR data was obtained with the Bio-Rad 
CFX96 Real-Time System and Bio-Rad CFX Manager (BIO-RAD, version 3.1.1517.0823). Fluorescent images were obtained using Leica 
Application Suite X software (Leica, version 2.0.0.14332.2). Immunoblotting images were obtained using ChemiDoc Touch Imaging 
System (BIO-RAD) and Image Lab Touch Software (BIO-RAD, version 2.3.0.07). 


Data analysis Flow cytometry data were analysed using FlowJo (FlowJo, version 10.4). Fluorescent images were analyzed using FUI/ImageJ (version 
1.52p) or Leica Application Suite X software (Leica, version 2.0.0.14332.2). Gene Set Enrichment Analysis was conducted using GSEA 
software (version 3.0) with gene sets derived from the KEGG pathway database in the MSigDB collections. Statistical analysis was 
performed using Graphpad Prism (GraphPad Software, version 7). For proteomics analysis, raw files were converted to mzXML using Raw 
File Reader (Thermo Fisher Scientifc, version 3.0.77) and spectra were searched using the SEQUEST software package. Search results 
were filtered using the Linear Discriminant Analysis (LDA) function in MASS Package in R as described in Huttlin et al. Cell 143, 1174-1189 
(2010). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


O 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 
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RNA-sequencing data are deposited to the Gene Expression Omnibus (GEO) data repository with accession number GSE145766. Source Data are provided for all 


experiments. Other data that support the findings of this study are available on request from the corresponding author upon reasonable request. 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


DX] Life sciences 


[_] Behavioural & social sciences [_] Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life scien 


ces study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size 


Data exclusions 


Replication 


Randomization 


Blinding 


Behaviou 


No sample size calculation was done either for in vitro or in vivo studies. For in vivo studies, sample sizes were determined based on our 
preliminary experiments. In our experience, n = 7-10 mice per group is sufficient to detect meaningful biological differences with good 
reproducibility. 


No data or animals were excluded from analysis. 


Experiments were repeated multiple times with similar results as indicated in the figure legends. All critical in vivo experiments were 
reproduced using the same system (e.g. repeat of autophagy inhibition in the same cell line multiple times using the same in vivo assays) and/ 
or using orthogonal approaches (e.g. repeated in multiple cell lines — HY19636, another C57BI/6-background mouse PDAC cell line; repeated 
in independent tumor models - orthotopic vs metastasis model; repeated with alternative approaches to decrease autophagy — Atg4b DN, 
shRNAs to Atg7, autophagy high/low populations, pharmacological - CQ; repeated using independent methods of impairing CD8 T-cells — CD8 
depleting antibodies, immunodeficient nude mice, Batf3 KO mice). Additionally, we reproduced the critical autophagy inhibition experiment 
with MHC-! upregulation and T-cell infiltration in a second mouse facility. 


Prior to treatment initiation, mice bearing tumours were randomized. Tumour sizes were not measured at the time of randomization. 


Blinding was not performed in mouse experiments. The investigator needed to know the treatment groups in order to perform the study. 
Tumour weights (an objective measurement) were carried out only at the study endpoints after mice were euthanized and tumours were 
harvested. 


ral & social sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description 


Research sample 


Sampling strategy 


Data collection 


Timing 


Data exclusions 


Non-participation 


Randomization 


Briefly describe the study type including whether data are quantitative, qualitative, or mixed-methods (e.g. qualitative cross-sectional, 
quantitative experimental, mixed-methods case study). 
State the research sample (e.g. Harvard university undergraduates, villagers in rural India) and provide relevant demographic information 
(e.g. age, sex) and indicate whether the sample is representative. Provide a rationale for the study sample chosen. For studies involving 
existing datasets, please describe the dataset and source. 


Describe the sampling procedure (e.g. random, snowball, stratified, convenience). Describe the statistical methods that were used to 
predetermine sample size OR if no sample-size calculation was performed, describe how sample sizes were chosen and provide a rationale 
for why these sample sizes are sufficient. For qualitative data, please indicate whether data saturation was considered, and what criteria 
were used to decide that no further sampling was needed. 


Provide details about the data collection procedure, including the instruments or devices used to record the data (e.g. pen and paper, 
computer, eye tracker, video or audio equipment) whether anyone was present besides the participant(s) and the researcher, and whether 
the researcher was blind to experimental condition and/or the study hypothesis during data collection 


Indicate the start and stop dates of data collection. If there is a gap between collection periods, state the dates for each sample cohort. 


If no data were excluded from the analyses, state so OR if data were excluded, provide the exact number of exclusions and the rationale 
behind them, indicating whether exclusion criteria were pre-established. 


State how many participants dropped out/declined participation and the reason(s) given OR provide response rate OR state that no 
participants dropped out/declined participation. 


If participants were not allocated into experimental groups, state so OR describe how participants were allocated to groups, and if 
allocation was not random, describe how covariates were controlled. 
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Ecological, evolutionary & environmental sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description 


Research sample 


Sampling strategy 


Data collection 


Timing and spatial scale 


Data exclusions 


Reproducibility 


Randomization 


Blinding 


Did the study involve field work? 


Briefly describe the study. For quantitative data include treatment factors and interactions, design structure (e.g. factorial, nested, 
hierarchical), nature and number of experimental units and replicates. 


Describe the research sample (e.g. a group of tagged Passer domesticus, all Stenocereus thurberi within Organ Pipe Cactus National 
Monument), and provide a rationale for the sample choice. When relevant, describe the organism taxa, source, sex, age range and 
any manipulations. State what population the sample is meant to represent when applicable. For studies involving existing datasets, 
describe the data and its source. 


Note the sampling procedure. Describe the statistical methods that were used to predetermine sample size OR if no sample-size 
calculation was performed, describe how sample sizes were chosen and provide a rationale for why these sample sizes are sufficient. 


Describe the data collection procedure, including who recorded the data and how. 
Indicate the start and stop dates of data collection, noting the frequency and periodicity of sampling and providing a rationale for 
these choices. If there is a gap between collection periods, state the dates for each sample cohort. Specify the spatial scale from which 


the data are taken 


If no data were excluded from the analyses, state so OR if data were excluded, describe the exclusions and the rationale behind them, 
indicating whether exclusion criteria were pre-established. 


Describe the measures taken to verify the reproducibility of experimental findings. For each experiment, note whether any attempts to 
repeat the experiment failed OR state that all attempts to repeat the experiment were successful. 


Describe how samples/organisms/participants were allocated into groups. If allocation was not random, describe how covariates were 
controlled. If this is not relevant to your study, explain why. 


Describe the extent of blinding used during data acquisition and analysis. If blinding was not possible, describe why OR explain why 
blinding was not relevant to your study. 


[| Yes [| No 


Field work, collection and transport 


Field conditions 


Location 


Access and import/export 


Disturbance 


Describe the study conditions for field work, providing relevant parameters (e.g. temperature, rainfall). 


State the location of the sampling or experiment, providing relevant parameters (e.g. latitude and longitude, elevation, water 
depth). 


Describe the efforts you have made to access habitats and to collect and import/export your samples in a responsible manner and 
in compliance with local, national and international laws, noting any permits that were obtained (give the name of the issuing 


authority, the date of issue, and any identifying information). 


Describe any disturbance caused by the study and how it was minimized. 
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Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 

system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 
Materials & experimental systems Methods 

n/a | Involved in the study n/a 

Antibodies | 


Involved in the study 
ChIP-seq 


Eukaryotic cell lines Flow cytometry 


Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used 


All antibodies, their supplier, and the dilution used is reported in detail in the methods section and Supplementary Table 1. 


mmunofluorescence (IF) (Target molecule, target species, host species, clone, conjugation, dilution, company, catalog number, 
Lot number) 

LAMP1 Human Rabbit D2D11 unconjugated 1:500 Cell Signaling Technology 9091 Lot 6 

HLA-A, B, C Human Mouse W6/32 unconjugated 1:250 Sigma-Aldrich H1650 098M4897V 

LC3B Human Rabbit D11 unconjugated 1:500 Cell Signaling Technology 3868 Lot 13 

gG secondary Human Rabbit N/A AF555 1:400 ThermoFisher Scientific A-21429 1937155 

gG secondary Human Mouse N/A AF488 1:250 ThermoFisher Scientific A-11029 TJ269844 


mmunoblotting (IB) (Target molecule, target species, host species, clone, dilution, company, catalog number, Lot number) 
LA-A, B, C Human Mouse EMR8-5 1:2000 Abcam ab70328 GR3266685-1 

LA-A, B Human Rabbit EPR1394Y 1:1000 Abcam ab134189 GR119003-11 

LA-A Human Rabbit EP1395Y 1:2000 Abcam ab52922 GR258732-23 

LA-B Human Rabbit polyclonal 1:1500 ThermoFisher Scientific PAS-35345 UB2717115 

LA-C Human Rabbit EPR6749 1:2500 Abcam ab126722 GR97813-8 

iemann Pick C1 Human Rabbit EPR5209 1:3000 Abcam ab134113 GR3230423-7 

GAPDH Human Mouse 6C5 1:1000 Santa Cruz sc-32233 E2419 

Alpha Tubulin Human Mouse 1£4C11 1:5000 Proteintech 66031-1-lg Lot# not available 
LAMP1 Human Mouse D2D11 1:1000 Cell Signaling Technology 9091 Lot 6 

ATG3 Human Rabbit polyclonal 1:1000 Cell Signaling Technology 3415 Lot 2 

ATG7 Human Rabbit D12B11 1:1000 Cell Signaling Technology 8558 Lot 3 

RB1CC1 (FIP200) Human Rabbit polyclonal 1:750 Protein-Tech 17250-1-AP Lot# not available 
ULK1 Mouse Rabbit D8H5 1:1000 Cell Signaling Technology 8054 Lot 6 

Rubicon Mouse Rabbit D9F7 1:1000 Cell Signaling Technology 8465 Lot 2 

ATG4B Mouse Rabbit polyclonal 1:1000 Cell Signaling Technology 5299 Lot 2 

NBR1 Human Mouse 4BR 1:500 Santa Cruz sc-130380 LO219 

Optineurin Human Rabbit polyclonal 1:500 Abcam ab23666 GR271296-15 

TAX1BP1 Human Rabbit polyclonal 1:1000 Sigma-Aldrich HPAO24432 B118261 
CALCOCO2/NDP52 Human Rabbit polyclonal 1:1000 Sigma-Aldrich HPAO23195 A65497 
p62/SQSTM1 Human Guinea Pig polyclonal 1:1000 Progen GP62-C DS-190318 

ATG7 Mouse Rabbit polyclonal 1:1000 Sigma-Aldrich A2856 076M4824V 

LC3B Mouse Rabbit polyclonal 1:1000 Novus Biologicals NB100-220 EM 

RFP/mStrawberry Rabbit polyclonal 1:1000 Rockland 600-401-379 35055 

B-actin Mouse Mouse AC-15 1:1000 Sigma-Aldrich A5441 127M4866V 

ono- and polyubiquitinated conjugates Human Mouse FK2 1:1000 Enzo BML-PW8810-0100 1031904 


iL LL 


mmunohistochemistry (IHC) (Target molecule, target species, host species, clone, conjugation, dilution, company, catalog 
number, Lot number) 

CK-19 Human Rat TROMA-III unconjugated 1:300 DSHB Lot# not available 

HLA-A,B Human Rabbit EPR1394Y unconjugated 1:300 abcam ab134189 GR119003-11 
pan-Cytokeratin Mouse Mouse C-11 AF488 1:200 BioLegend 628608 B264191 

CD8a Mouse Rat 4SM15 unconjugated 1:100 Fisher 14-0808-80 2144532 
RFP/mStrawberry N/A Rabbit polyclonal unconjugated 1:400 Rockland 600-401-379 35055 
G 

e 

e 

R 

R 


D3 Mouse Rat 17A2 unconjugated 1:100 BioLegend 100201 B229443 

D8a Mouse Rat 53-6.7 unconjugated 1:100 BioLegend 100701 B267129 

D8a Mouse Rat 53-6.7 AF647 1:100 BioLegend 100727 B262298 

at IgG Rat Goat polyclonal AF647 1:400 Thermo Fisher A-21247 2119156 

abbit IgG Rabbit Goat polyclonal AF555 1:400 Thermo Fisher A-21428 1937155 


Flow Cytometry (FCM) (Target molecule, target species, host species, clone, conjugation, dilution, company, catalog number, Lot 
number) 

HLA-A, B, C Human Mouse W6/32 AF488 1:75 BioLegend 311413 B256489 

HLA-A, B, C Human Mouse W6/32 PE 1:75 BioLegend 311406 B265864 
Cc 
e 


D4 Mouse Rat GK1.5 BV605 1:100 BioLegend 100451 B258998 

D45 Mouse Rat 30-F11 BUV395 1:100 BD Biosciences 564279 9171873 

Foxp3 Mouse Rat FJK-16s eFluor 450 1:25 eBioscience 48-5773-82 1988692 

D3 Mouse Rat 17A2 BV711 1:100 BioLegend 100241 B265870 

220 Mouse Rat RA3-6B2 APC 1:100 BioLegend 103212 B242829 

1.1 Mouse Mouse PK136 APC/Cy7 1:100 BioLegend 108723 B261757 

D11b Mouse Rat M1/70 PE/Cy7 1:100 BioLegend 101215 B268266 

D11c Mouse Armenian Hamster N418 Pacific 1:100 BioLegend 117321 B257457 
D103 Mouse Armenian Hamster 2E7 AF488 1:100 BioLegend 121407 B276138 
F4/80 Mouse Rat BM8 BV605 1:100 BioLegend 123133 B268952 

Gr1 Mouse Rat RB6-8C5 APC/Cy7 1:100 BioLegend 108423 B271583 

CD8a Mouse Rat 53-6.7 AF700 1:100 BioLegend 100730 B246153 

CD8a Mouse Rat 53-6.7 AF647 1:200 BioLegend 100727 B262298 

HC-II Mouse Rat M5/114.15.2 AF700 1:100 BioLegend 107622 B264454 

PD-1 Mouse Rat 29F.1A12 BV785 1:100 BioLegend 135225 B261641 

Tim3 Mouse Rat RMT3-23 APC 1:100 BioLegend 119705 B265068 

PD-L1 Mouse Rat 10F.9G2 BV711 1:100 BioLegend 124319 B257887 
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Validation 


H-2 Mouse Rat M1/42 FITC 1:100 BioLegend 125507 B247755 

H-2Kb Mouse Mouse AF6-88.5 AF647 1:100 BioLegend 116512 B265968 
H-2Kb/H-2Db Mouse Mouse 28-8-6 AF647 1:100 BioLegend 114612 B253985 

H-2Kb bound to SIINFEKL Mouse Mouse 25-D1.16 APC 1:100 BioLegend 141605 B253603 
H-2Db Mouse Mouse KH95 APC 1:100 BioLegend 111513 B269797 

Transferrin receptor Mouse Rat RI7217 APC 1:100 BioLegend 113819 B264912 

HVEM Mouse Armenian Hamster HMHV-1B18 APC 1:100 BioLegend 136305 B266707 
Galectin-3 Mouse Rat M3/38 APC 1:100 BioLegend 125419 B272403 

Galectin-9 Mouse Rat RG9-35 APC 1:100 BioLegend 136109 B245181 

E-Cadherin Mouse Rat DECMA-1 AF647 1:100 BioLegend 147307 B220119 

CD44 Mouse Rat IM7 APC 1:100 BioLegend 103012 B265921 

TNFRSF1A Mouse Armenian Hamster 55R-286 APC 1:100 BioLegend 113005 B255749 
EpCAM Mouse Rat G8.8 APC 1:100 BioLegend 118213 B249071 

Fas (CD95) Mouse Mouse SA367H8 APC 1:100 BioLegend 152603 B278865 

DR5 Mouse Armenian Hamster MD5-1 PE 1:100 BioLegend 119905 B260464 

CD155 Mouse Rat TX56 APC 1:100 BioLegend 131509 B284616 

PD-L1 Mouse Rat 10F.9G2 APC 1:100 BioLegend 124311 B241508 


Blocking antibodies for in vitro and in vivo experiments (name, host species, clone, dilution, company, catalog number, Lot 
number). 

nVivoMab anti-mouse CD8a Rat 53-6.7 200 mg/mouse BioXCell BEO004-1 693018M2 

nVivoMAb rat IgG2a isotype control Rat 2A3 200 mg/mouse BioXCell BEOO89 716719J3 

nVivoMAb anti-mouse PD-1 Rat RMP1-14 200 mg/mouse BioXCell BEO146 735019J3 

nVivoMAb polyclonal Syrian hamster IgG Hamster N/A 200 mg/mouse BioXCell BEO087 709119A1 

nVivoMAb anti-mouse CTLA-4 Hamster 9H10 200 mg/mouse BioXCell BEO131 704019A2 

nVivoMAb anti-mouse MHC Class | (H-2Kb) bound to SIINFEKL peptide Rat 25-D1.16 BioXCell BEO207 507816J3 
nVivoMAb mouse IgG1 isotype control Mouse MOPC-21 BioXCell BEO083 701618J2 


All antibodies used are commercially available and validated by the manufacturers. For the proteins listed below, specificities of 
the antibodies were validated using appropriate controls (cells deficient for or overexpressing the target molecules, cells treated 
with reagents known to increase or decrease the protein, or isolated organelles with specific protein expression, etc.): LAMP1, 
LC3B, ATG3, ATG7, RB1CC1 (FIP200), ULK1, Rubicon, ATG4B, NBR1, RFP, mStrawberry, mono- and polyubiquitinated proteins, 
H-2, H-2Kb, H-2Db, H-2Kb/Db, and H-2Kb bound to SIINFEKL. For flow cytometry, fluorescence minus one (FMO) controls and 
isotype controls were used to accurately determine the thresholds. 


Additional information can be found on the manufactures’ websites: (Target Antigen, Company, Catalog #, URL) 


mmunofluorescence (IF) 
LAMP1 Cell Signaling Technology 9091 https://www.cellsignal.com/products/primary-antibodies/lamp1-d2d11-xp-rabbit- 
mab/9091 

LA-A, B, C Sigma-Aldrich H1650 https://www.sigmaaldrich.com/catalog/product/sigma/h1650 

LC3B Cell Signaling Technology 3868 https://www.cellsignal.com/products/primary-antibodies/lc3b-d11-xp-rabbit-mab/3868 


mmunoblotting (IB) 

HLA-A, B, C Abcam ab70328 https://www.abcam.com/hla-class-1-abc-antibody-emr8-5-ab70328.html 

HLA-A, B Abcam ab134189 https://www.abcam.com/mhc-class-i--hla-a--hla-b-antibody-epr1394y-ab134189.html 

HLA-A Abcam ab52922 https://www.abcam.com/hla-a-antibody-ep1395y-ab52922.html 

HLA-B ThermoFisher Scientific PA5-35345 https://www.thermofisher.com/antibody/product/HLA-B-Antibody-Polyclonal/ 
PA5-35345 

HLA-C Abcam ab126722 https://www.abcam.com/hla-c-antibody-epr6749-ab126722.html 

iemann Pick C1 Abcam ab134113 https://www.abcam.com/niemann-pick-c1-antibody-epr5209-ab134113.html 

GAPDH Santa Cruz sc-32233 https://www.scbt.com/p/gapdh-antibody-6c5;jsessionid=O9eiHtvkm7s7v2yLUzMIb8KDQ3CExutf- 
M9rXjPj8JoNEVczZjR!-1044019884!2132931255 

Alpha Tubulin Proteintech 66031-1-lg https://www.ptglab.com/products/tubulin-Alpha-Antibody-66031-1-lg.htm 

LAMP1 Cell Signaling Technology 9091 https://www.cellsignal.com/products/primary-antibodies/lamp1-d2d11-xp-rabbit- 
mab/9091 
ATG3 Cell Signaling Technology 3415 https://www.cellsignal.com/products/primary-antibodies/atg3-antibody/3415 

ATG7 Cell Signaling Technology 8558 https://www.cellsignal.com/products/primary-antibodies/atg7-d12b11-rabbit-mab/8558 
RB1CC1 (FIP200) Protein-Tech 17250-1-AP https://www.ptglab.com/Products/RB1CC1-Antibody-17250-1-AP.htm 

ULK1 Cell Signaling Technology 8054 https://www.cellsignal.com/products/primary-antibodies/ulk1-d8h5-rabbit-mab/8054 
Rubicon Cell Signaling Technology 8465 https://www.cellsignal.com/products/primary-antibodies/rubicon-d9f7-rabbit-mab/8465 
ATG4B Cell Signaling Technology 5299 https://www.cellsignal.com/products/primary-antibodies/atg4b-antibody/5299 

NBR1 Santa Cruz sc-130380 https://www.scbt.com/p/nbr1-antibody-4br?productCanUrl=nbr1- 
antibody-4br&_requestid=2564925 

Optineurin Abcam ab23666 https://www.abcam.com/optineurin-antibody-ab23666.html 

TAX1BP1 Sigma-Aldrich HPAO24432 https://www.sigmaaldrich.com/catalog/product/sigma/hpa024432 

CALCOCO2/NDP52 Sigma-Aldrich HPAO23195 https://www.sigmaaldrich.com/catalog/product/sigma/hpa023195 

p62/SQSTM1 Progen GP62-C https://www.progen.com/anti-p62-sqstm1-c-terminus-guinea-pig-polyclonal-serum.html 

ATG7 Sigma-Aldrich A2856 https://www.sigmaaldrich.com/catalog/product/sigma/a2856 

LC3B Novus Biologicals NB100-220 https://www.novusbio.com/products/lc3b-antibody_nb100-2220 

RFP/mStrawberry Rockland 600-401-379 https://rockland-inc.com/store/Antibodies-to-GFP-and-Antibodies-to- 
RFP-600-401-379-O4L_24299.aspx 

B-actin Sigma-Aldrich A5441 https://www.sigmaaldrich.com/catalog/product/sigma/a5441 

Mono- and polyubiquitinated conjugates Enzo BML-PW8810-0100 https://www.enzolifesciences.com/BML-PW8810/mono-and- 
polyubiquitinylated-conjugates-monoclonal-antibody-fk2/ 
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Eukaryotic cell lines 


Immunohistochemistry (IHC) 

-19 DSHB TROMA-III https://dshb.biology.uiowa.edu/TROMA-III 

HLA-A,B abcam ab134189 https://www.abcam.com/mhc-class-i--hla-a--hla-b-antibody-epr1394y-ab134189.html 
pan-Cytokeratin BioLegend 628608 https://www.biolegend.com/en-us/products/alexa-fluor-488-anti-cytokeratin-pan-reactive- 
a 

Cc 


Qa 


ntibody-13383 

D8a Fisher 14-0808-80 https://www.thermofisher.com/antibody/product/CD8a-Antibody-clone-4SM15- 
onoclonal/14-0808-82 

FP/mStrawberry Rockland 600-401-379 https://rockland-inc.com/store/Antibodies-to-GFP-and-Antibodies-to- 
FP-600-401-379-O4L_24299.aspx 

D3 BioLegend 100201 https://www.biolegend.com/en-us/products/purified-anti-mouse-cd3-antibody-48 

D8a BioLegend 100701 https://www.biolegend.com/en-us/products/purified-anti-mouse-cd8a-antibody-157 

D8a BioLegend 100727 https://www.biolegend.com/en-us/products/alexa-fluor-647-anti-mouse-cd8a-antibody-2699 
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Flow Cytometry (FCM) 

HLA-A, B, C BioLegend 311413 https://www.biolegend.com/en-us/products/alexa-fluor-488-anti-human-hla-a-b-c-antibody-2899 
HLA-A, B, C BioLegend 311406 https://www.biolegend.com/en-us/products/pe-anti-human-hla-a-b-c-antibody-1872 

CD4 BioLegend 100451 https://www.biolegend.com/en-us/products/brilliant-violet-605-anti-mouse-cd4-antibody-10708 

CD45 BD Biosciences 564279 https://www.bdbiosciences.com/us/reagents/research/antibodies-buffers/immunology-reagents/ 
anti-mouse-antibodies/cell-surface-antigens/buv395-rat-anti-mouse-cd45-30-f11/p/564279 

Foxp3 eBioscience 48-5773-82 https://www.thermofisher.com/antibody/product/FOXP3-Antibody-clone-FJK-16s- 
onoclonal/48-5773-82 

D3 BioLegend 100241 https://www.biolegend.com/en-us/products/brilliant-violet-711-anti-mouse-cd3-antibody-10022 

220 BioLegend 103212 https://www.biolegend.com/en-us/products/apc-anti-mouse-human-cd45r-b220-antibody-442 

1.1 BioLegend 108723 https://www.biolegend.com/en-us/products/apc-cy7-anti-mouse-nk-1-1-antibody-4002 

D11b BioLegend 101215 https://www.biolegend.com/en-us/products/apc-cy7-anti-mouse-nk-1-1-antibody-4002 

D11c BioLegend 117321 https://www.biolegend.com/en-us/products/pacific-blue-anti-mouse-cd11c-antibody-3864 

D103 BioLegend 121407 https://www.biolegend.com/en-us/products/alexa-fluor-488-anti-mouse-cd103-antibody-3575 

F4/80 BioLegend 123133 https://www.biolegend.com/en-us/products/brilliant-violet-605-anti-mouse-f4-80-antibody-8702 

Gri BioLegend 108423 https://www.biolegend.com/en-us/products/apccyanine7-anti-mouse-ly-6gly-6c-gr-1-antibody-3935 
CD8a BioLegend 100730 https://www.biolegend.com/en-us/products/alexa-fluor-700-anti-mouse-cd8a-antibody-3387 

CD8a BioLegend 100727 https://www.biolegend.com/en-us/products/alexa-fluor-647-anti-mouse-cd8a-antibody-2699 

HC-II BioLlegend 107622 https://www.biolegend.com/en-us/products/alexa-fluor-700-anti-mouse-i-a-i-e-antibody-3413 

PD-1 BioLegend 135225 https://biolegend.com/en-us/products/brilliant-violet-785-anti-mouse-cd279-pd-1-antibody-9874 

Tim3 BioLegend 119705 https://www.biolegend.com/en-us/products/apc-anti-mouse-cd366-tim-3-antibody-8238 

PD-L1 BioLegend 124319 https://www.biolegend.com/en-us/products/brilliant-violet-711-anti-mouse-cd274-b7-h1--pd-|1- 
antibody-9808 

H-2 BioLegend 125507 https://www.biolegend.com/en-us/products/fitc-anti-mouse-h-2-antibody-5346 

H-2Kb BioLegend 116512 https://www.biolegend.com/en-us/products/alexa-fluor-647-anti-mouse-h-2kb-antibody-3238 
-2Kb/H-2Db BioLegend 114612 https://www.biolegend.com/en-us/products/alexa-fluor-647-anti-mouse-h-2kb-h-2db- 
antibody-3222 

H-2Kb bound to SIINFEKL BioLegend 141605 https://www.biolegend.com/en-us/products/apc-anti-mouse-h-2kb-bound-to- 
siinfekl-antibody-7882 

H-2Db BioLegend 111513 https://www.biolegend.com/en-us/products/apc-anti-mouse-h-2d-b-antibody-11947 

Transferrin receptor BioLegend 113819 https://www.biolegend.com/en-us/products/apc-anti-mouse-cd71-antibody-15498 
HVEM BioLegend 136305 https://www.biolegend.com/en-us/products/apc-anti-mouse-cd270-hvem-antibody-6419 

Galectin-3 BioLegend 125419 https://www.biolegend.com/en-us/products/apc-anti-mouse-human-mac-2-galectin-3- 
antibody-15145 
Galectin-9 BioLegend 136109 https://www.biolegend.com/en-us/products/apc-anti-mouse-galectin-9-antibody-8239 
E-Cadherin BioLegend 147307 https://www.biolegend.com/en-us/products/alexa-fluor-647-anti-mouse-human-cd324-e- 
cadherin-antibody-10024 
CD44 BioLegend 103012 https://www.biolegend.com/en-us/products/apc-anti-mouse-human-cd44-antibody-312 

TNFRSF1A (CD120a) BioLegend 113005 https://www.biolegend.com/en-us/products/apc-anti-mouse-cd120a-tnf-r-type-i-p55- 
antibody-6268 

EpCAM BioLegend 118213 https://www.biolegend.com/en-us/products/apc-anti-mouse-cd326-ep-cam-antibody-4974 

Fas (CD95) BioLegend 152603 https://www.biolegend.com/en-us/products/apc-anti-mouse-cd95-fas-antibody-13906 

DR5 BioLegend 119905 https://www.biolegend.com/en-us/products/pe-anti-mouse-cd262-dr5--trail-r2-antibody-2668 

CD155 BioLegend 131509 https://www.biolegend.com/en-us/products/apc-anti-mouse-cd155-pvr-antibody-6722 

PD-L1 BioLegend 124311 https://www.biolegend.com/en-us/products/apc-anti-mouse-cd274-b7-h1--pd-l1-antibody-6655 
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Policy information about cell lines 


Cell line source(s) 


Authentication 


The cell lines PaTu-8988T, KP4, MiaPaca2, Panc 2.03, PaTu-8902, Panc1, AsPc1, HupT3, A549 were obtained from the 
American Type Culture Collection (ATCC) or the DSMZ. H441, H358, HCT116 and BEAS-2B were provided by Eric Collisson, 
UCSF. HPDE was provided by Ming Tsao (ref 42). Primary murine PDAC cell lines were established from pancreatic tumours in 
respective GEMMs. HY15549 and HY19636 were derived from female KPC mice (p48-Cre+;KrasLSL-G12D/+;Trp53lox/+) that 
were fully backcrossed into a C57BL/6 background. Atg5+/+ KPC cells were derived from Pdx1-Cre+; KrasLSL-G12D/ 
+;Trp53lox/+;Atg5+/+ mice. Atg5-/- KPC cells were derived from Pdx1-Cre+;KrasLSL-G12D/+;Trp53lox/+;Atg5lox/lox mice (ref 
10). AY6284 cells were derived from p48-Cre+;KrasLSL-G12D/+;Trp53lox/+;RosaLSL-rtTA;mSt-Atg4BC74A mice (ref 12). 


All human cell lines acquired from ATCC or DMSZ were validated by STR profiling, visual inspection, and careful 
maintainenance in a central lab cell bank. 
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Mycoplasma contamination Routine testing for mycoplasma was conducted by PCR. All cell lines tested negative for mycoplasma. 


Commonly misidentified lines No commonly misidentified cell lines were used. 
(See ICLAC register) 


Palaeontology 


Specimen provenance 


Specimen deposition Indicate where the specimens have been deposited to permit free access by other researcher 


Dating methods 


lates 
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Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Female 8-10 week old C57BL/6 mice or NCr nude (B6NTac, Taconic) were purchased from Taconic and used for allograft 
experiments. OT-I transgenic mice (Cat# 003831) and Batf3-/- mice (Cat# 013755) were purchased from Jackson Laboratory. 


Wild animals This study did not involve wild animals. 
Field-collected samples This study did not involve samples collected in the field. 
Ethics oversight All experiments involving laboratory animals were approved by the New York University Institutional Animal Care and Use 


Committee (IACUC) under protocol numbers |A16-00507 and IA16-01331. This is indicated in the methods section. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Human PDAC specimens were obtained from patients who underwent surgical resection of the primary PDAC tumours. Clinical 
characteristics of the patients are described in Supplementary Table 3. 


Recruitment Surgical resection of PDAC patients at UCSF were used for the study under an Institutional Review Board (IRB)-approved protocol 
(18-25787) at UCSF. The study had minimal risk per the IRB protocol and thus informed consent was not necessary. 


Ethics oversight Samples were collected at UCSF under IRB-approved protocol 18-25787. This study was conducted under strict compliance with 
institutional ethical regulations. The study had minimal risk per the IRB protocol and thus informed consent was not necessary. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Clinical data 


Policy information about clinical studies 
All manuscripts should comply with the ICMJE guidelines for publication of clinical research and a completed CONSORT checklist must be included with all submissions. 


Clinical trial registration n/a 
Study protocol n/a 
Data collection n/a 
Outcomes n/a 


ChIP-seq 


Data deposition 


Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links For "Initial submission" or "Re 
May remain private before publication. provide 


Files in database submission Provide a list of all files available in the database submission 


Genome browser session 
(e.g. UCSC) 


Methodology 


Replicates Describe the experimenta 


Sequencing depth 


Antibodies Describe the antibodies u 
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Peak calling parameters 


Data quality 


Software 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation For surface and intracellular MHC-I staining of human cell lines, cells were stained with Alexa Fluor® 488 anti-human HLA-A, B, C 
antibody (BioLegend, clone W6/32) at a 1:75 dilution for 45 minutes at 4°C in the dark and washed with PBS plus 2% FBS and 
2mM EDTA (FACS buffer). Cells were then fixed and permeabilized prior to staining with PE conjugated anti-human HLA-A, B, C 
antibody (BioLegend, clone W6/32) at a 1:75 dilution for 45 min at room temperature and washed with FACS buffer. For cell 
surface molecule staining of mouse cells, single cell suspensions were washed with FCM buffer (HBSS containing 1% FBS, 1 mM 
EDTA, and 10 mM HEPES) and stained with antibodies (Supplementary Table 1) at 4°C in the dark for 20 min. Dead cells were 
depleted by DAPI staining. 


For the immunophenotyping of tumours, tissues were mechanically minced with scissors, and then digested in DMEM containing 
1 mg/mL Collagenase IV (Gibco), 100 g/mL DNase | (Roche), 1% FBS, 10 mM HEPES, 2% Antibiotic-Antimycotic (ThermoFisher 
Scientific) for 40 min at 37°C in the dark with gentle agitation every 10 min. Digested tissues were then washed twice in DMEM 
containing 10% FBS, filtered through a 40-um nylon mesh strainer (Corning). Cells were suspended in ACK Lysing Buffer 
(ThermoFisher Scientific), incubated for 10 min at 4°C in the dark to remove red blood cells. Cells were washed twice in FCM 
buffer and counted. Cells were stained with Zombie Aqua Fixable Viability Kit (BioLegend) and blocked with anti-mouse CD16/ 
CD32 antibody (Mouse BD Fc Block™, clone 2.4G2, BD Bioscience). One million cells were incubated with appropriate antibodies 
(Supplementary Table 1) diluted in FCM buffer at 4°C in the dark for 40 min. Cells were then washed twice with FCM buffer and 
analyzed or further fixed in phosphate-buffered saline (PBS) containing 0.1 M L-Lysine (Sigma) and 2% paraformaldehyde (Ted 
Pella) at 4°C in the dark overnight. Intracellular staining was performed using Foxp3/Transcription Factor Staining Buffer Set 
(eBioscience). 


Instrument Cells were analyzed on a BD LSR Fortessa or a BD LSR-II UV and analyzed by FlowJo software (FlowJo, LLC, version 10.4). 


Software FACSDiva software (BD Biosciences) was used to collect the data, and the data was analyzed by FlowJo software (FlowJo, LLC, 
version 10.4). 


Cell population abundance The purity of cells after sorting was confirmed by post-sort analyses and was usually > 90%. 


Gating strategy Gating strategies were described in Extended Data Fig. 7 and Supplementary Table 2. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 


Magnetic resonance imaging 


Experimental design 


Design type 


Design specifications 

Behavioral performance measures 
Acquisition 

Imaging type(s) 


Field strength 


Sequence & imaging parameters 


Area of acquisition 


Diffusion MRI 


[_] Used 


Preprocessing 


Preprocessing software 
Normalization 
Normalization template 
Noise and artifact removal 


Volume censoring 


Statistical modeling & inference 


Model type and settings 


Effect(s) tested 


Specify type of analysis: 


Statistic type for inference 
(See Eklund et al. 2016) 


Correction 


Models & analysis 


n/a | Involved in the study 


Graph analysis 


Functional and/or effective connectivity 


Graph analysis 


Whole brain 


Indicate task or resting state; event-related or block design. 


Specify the number of blocks, trials or experimental units per session and/or subject, and specify the length of each trial 
or block (if trials are blocked) and interval between trials. 


State number and/or type of variables recorded (e.g. correct button press, response time) and what statistics were used 


to establish that the subjects were performing the task as expected (e.g. mean, range, and/or standard deviation across 
subjects). 


Specify: functional, structural, diffusion, perfusion. 
Specify in Tesla 


Specify the pulse sequence type (gradient echo, spin echo, etc.), imaging type (EPI, spiral, etc.), field of view, matrix size, 
slice thickness, orientation and TE/TR/flip angle. 


State whether a whole brain scan was used OR define the area of acquisition, describing how the region was determined. 


[_] Not used 


Provide detail on software version and revision number and on specific parameters (model/functions, brain extraction, 
segmentation, smoothing kernel size, etc.). 


If data were normalized/standardized, describe the approach(es): specify linear or non-linear and define image types 
used for transformation OR indicate that data were not normalized and explain rationale for lack of normalization. 


Describe the template used for normalization/transformation, specifying subject space or group standardized space (e.g. 
original Talairach, MNI305, ICBM152) OR indicate that the data were not normalized. 


Describe your procedure(s) for artifact and structured noise removal, specifying motion parameters, tissue signals and 
physiological signals (heart rate, respiration). 


Define your software and/or method and criteria for volume censoring, and state the extent of such censoring. 


Specify type (mass univariate, multivariate, RSA, predictive, etc.) and describe essential details of the model at the first 
and second levels (e.g. fixed, random or mixed effects; drift or auto-correlation). 


Define precise effect in terms of the task or stimulus conditions instead of psychological concepts and indicate whether 
ANOVA or factorial designs were used. 


ROI-based Both 


Specify voxel-wise or cluster-wise and report all relevant parameters for cluster-wise methods. 


Describe the type of correction and how it is obtained for multiple comparisons (e.g. FWE, FDR, permutation or Monte 
Carlo). 


Functional and/or effective connectivity 


Multivariate modeling or predictive analysis 


Report the measures of dependence used and the model details (e.g. Pearson correlation, partial 
correlation, mutual information). 


Report the dependent variable and connectivity measure, specifying weighted graph or binarized graph, 
subject- or group-level, and the global and/or node summaries used (e.g. clustering coefficient, efficiency, 
etc.). 
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Multivariate modeling and predictive analysis fe independent variables, features extraction and dimension reduction, model, training and aa 
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Work 


Send your careers story 
to: naturecareerseditor 
@nature.com 


KINDNESS UNDER 
CORONAVIRUS 


COVID-19 lockdowns could lead toa more 
respectful research culture. By Gemma Derrick 


n14 March, Spanish Prime Minister 

Pedro Sanchez announced a state 

of alarm (Estado de alarma) anda 

nationwide COVID-19 lockdown. 

This closed schools and led to some 

of Europe’s most severe social-distancing 

rules, punishable by fines and enforced by a 

constant on-street presence of police andthe 

army, often using drones. Similar restrictions 

had already been seen in China and Italy, andin 

the following weeks other European countries 

— including the United Kingdom — as well as 

the United States and my native Australia all 
followed suit. 

Spain is where I found myself, a 

higher-education researcher, on lockdown 

day — weeks into a five-month research visit 


to the country’s national research council. | 
was away from home and my job at Lancaster 
University, UK, unable to go outside, and 
juggling the homeschooling of my five-year- 
old son in central Madrid for an unknown 
amount of time. 

But alongside fear, limmediately witnessed 
an outpouring of kindness from colleagues 
and from the scientific community. My work 
focuses on building a kinder, gentler, more 
inclusive research culture by modifying one of 
its harshest processes, peer review. I know all 
too well how science can be an ultra-compet- 
itive, Machiavellian environment that can be 
unforgiving to individuals who are in need of 
guidance or generosity. However, in the light 
of COVID-19, both individual researchers and 
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research organizations are showing that there 
is a way of doing what we do that is not driven 
by ambition and ego. 

Might a disease that forces us apart through 
social distancing make us closer in the long 
run? Hereisa list of things | have experienced 
that make me think that kindness in research 
is possible: 

* The online platforms I use for work (Micro- 
soft Teams, Zoom, Skype and, more than ever, 
Twitter) are full of messages of support and 
unity from colleagues across the world, and 
with cheerful puppy pictures and virtual 
coffee breaks that are better attended than 
the physical variety. 

-A once narrow, competitive drive to collect 
data for our individual research ambitions has 
been replaced by dialogue about whether now, 
with everyone’s minds otherwise occupied, is 
really the best time to be collecting data? This 
goes beyond concerns about data reliability 
and reproducibility, towards a type of empathy 
and foresight that is the bedrock of research 
kindness. 

¢ Deadlines for grant proposals have been 
relaxed and postponed. And normally overly 
bureaucratic procedures, such as lodging 
requests for no-cost extensions on grants to 
accommodate interrupted research sched- 
ules, have been met with promises that such 
extensions will be given, no questions asked. 

¢ Ane-mail from my university’s ethics com- 
mittee demanding changes to my application 
within a strict time frame was followed up, 
after my out-of-office message, with a “don’t 
worry about your ethics application, stay safe”, 
allowing a blessed moment of relief. 

* As flexible-working requests have been 
granted and home working enforced, expec- 
tations about outputs have been replaced with 
empathy and understanding, an acknowledge- 
ment of caring responsibilities or sickness, and 
anxiety about our careers, families, friends 
and the world. 

* On 24 March, the 2021 UK Research Excel- 
lence Framework, the system for assessing 
research quality and a great source of stress 
to university academics, was postponed until 
further notice. This is explicitly to accommo- 
date the disruption universities face as the 
result of COVID-19. The announcement was 
met with widespread relief across the sector. 

* Commercial entities in academia, such as 
journals and publishing houses, are forgoing 
paywalls and making their entire content on 
COVID-19 open access, a move that until now 
was subject toimplementing a highly complex 
arrangement such as the open-access move- 
ment Plan S. In many cases, this lowering of 
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the paywall applies beyond papers related to 
COVID-19, providing essential reading mat- 
erial to relieve the inevitable boredom and the 
anxiety of being locked in our houses, proving 
that if there is a will, there is a way. 

- Finally, automated messages fromjournals 
chasing peer reviews have been eliminated 
and extensions offered. This crisis has high- 
lighted how automated computerized actions 
are inhuman during a time when researchers 
are constrained by new methods of working 
and teaching, illustrating the need for general 
humanity in our research behaviour. 

In my view, kindness in research culture is 
possible beyond COVID-19 — and efforts to 
make the research world a nicer place to work 
were happening long before the coronavirus 
crisis. In January 2018, Nature reported ona 
‘Kindness in Science’ workshop held at the 
University of Auckland, New Zealand (see 
Nature 553, 367-369; 2018). And in September 
last year, the UK charitable research funder 
Wellcome launched a “reimagine research” 
campaign. A summit on the topic, due to be 
held in London on 18 March, was postponed 
until later in 2020 because of COVID-19. 

For change to happen, it’s important to 
recognize why previous calls for a culture 
shift were heeded but rarely put into prac- 
tice. Why was this? A reluctance to change, 


“The pandemic has 
shown that with the 
right impetus, change 
canhappen.” 


lack of reward and recognition, personal 
ambition? Perhaps it goes into the ‘too hard’ 
basket because of institutionalized practices 
(human, managerial and technological) that 
are, by nature, resistant to change. 

The increased competitiveness and 
pressure towards research and university 
excellence (highlighted during the February 
2020 strikes by UK academics) drive aculture 
in which the mental health of researchers is 
compromised and discriminatory behav- 
iours are overlooked, and then embedded in 
research practice. 

The COVID-19 pandemic has shown that 
with the right impetus, change can happen. 
There are still many elements of research cul- 
ture that are in need of change, suchas gender 
and racial disparities, bias and the need for 
more opportunities to foster the careers of 
young researchers, to name a few. Inthe mean- 
time, we could use the momentum of COVID- 
19 to firmly embed kindness into research 
practice, extending greater goodwill beyond 
this temporary situation. 


Gemma Derrick is a senior lecturer in higher 
education at Lancaster University, UK. 


108 | Nature | Vol 581 | 7 May 2020 


SEEKING SOLACE 
IN SCIENCE FUN 


Videos, podcasts and light reading can 
help to lift the mood. By Atma Ivancevic 


ind your passion’ is, | think, one 
of the most overused and unhelp- 
ful clichés spouted on university 
campuses. The idea that every- 
one has a fixed set of interests 
is harmful, because it encourages us to drop 
subjects that don’t instantly bedazzle us. It cre- 
ates the illusion that the perfect project will 
always be enjoyable. In reality, every scientific 
venture comes with its own set of challenges. 
And choosing a PhD or postdoctoral project is 
influenced as much by factors suchas location 
and availability as by topic. 

Although PhD work can spark passion, it has 
a dark side. Nature’s November 2019 survey 
of more than 6,000 graduate students found 
that although most respondents were satis- 
fied with their decision to pursue a PhD, more 
than one-third had sought help for anxiety or 
depression caused by their PhD studies (see 
Nature 575, 406-406; 2019). The fact that 
students keep enrolling in PhD programmes, 
despite the documented experiences of past 
students, perhaps points towards a love of sci- 
ence and discovery that doesn’t always make 
its presence felt in the day-to-day work ofa 
PhD student. 

Scientists everywhere have faced unprec- 
edented challenges in the past few weeks as 
laboratories shut down and employees shifted 
to working from home. For some of us, these 
changes occurred within the space of a week, 
meaning that all experiments and cell lines 
had to be thrown out or frozen downat onlya 
few days’ notice. Having worked from home 
for two weeks now, I've realized that life as a 
remote scientist is full of Zoom lab meetings, 
Zoom hacking sessions, Zoom catch-ups and 
occasional Zoom happy hours (complete with 
‘quarantinis’). As acomputational biologist 
with no children or teaching obligations, the 
transition to working from home should the- 
oretically have been easy. As a human with 
increasing worries and a limited attention 
span, the current situation is, at best, dis- 
tracting. 

Personally, I’ve stopped counting how 
many ‘productive’ hours I have in a day. 
Instead, I’ve found it helpful to stay up to 
date with aspects of science that interest me, 
even if my current topic or task is not going 
smoothly. I find the reminder that I’m still 
‘into’ science, which] often get from outside 


aa 
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my day-to-day work, a reassurance during 
these uncertain times. Here are some of the 
resources I use whenever I need a science 
pick-me-up. 


YouTube 


Learning scientific concepts through video 
can be entertaining, informative and some- 
times hilarious. Here are some of my favourite 
channels. 

Simone Giertz (go.nature.com/2kr4vbn). 
Formerly known as the Queen of Shitty Robots, 
Giertz has made videos that showcase the 
delight of making useless things, as well as 
not-so-useless things such as a Tesla turned 
pick-up truck — Truckla. 

Acapella science (go.nature.com/2xve6bf). 
Combining deep science with sweet harmony, 
Tim Blais creates musical summaries of ahuge 
range of science topics including, but not lim- 
ited to, evolutionary developmental biology, 
quarks and exoplanets. 

True Facts by zefrankl (go.nature. 
com/3awzxzm). As an Australian in the United 
States studying mammals, | often get asked 
about kangaroos and koalas. I always recom- 
mend his marsupials explanation (go.nature. 
com/2vuyuun). 


Podcasts 


Currently, my go-to science podcasts are Buffs 
Talk Science and Science Vs. 


Layperson-friendly reading 


| often read summaries (across all fields, not 
just genetics) by Ed Yong (The Atlantic) and 
Carl Zimmer (The New York Times) before pur- 
suing the underlying publications. 


Science is meant to be fun. Taking a break to 
program with pictures or revel in an interest- 
ing discovery about barnacle sex can make 
the self-isolation of working from home more 
bearable. Whether that means experiencing 
Eminem-inspired physics a capella or discov- 
ering anew podcast, we could all use some 
light-hearted science right now. 


Atma lIvancevic is a bioinformatics postdoc 
at the University of Colorado Boulder. She 
studies ancient retroviruses: genetic parasites 
that invaded our ancestors millions of years 
ago and still have a role in modern diseases. 
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A student at the University of Warsaw assembles 3D-printed protective masks. 


OPEN SCIENCE 
TAKES ON COVID-19 


Data sharing and hobbyists are being harnessed 
to combat the pandemic. By Mark Zastrow 


hen reports emerged in late 2019 

of an outbreak of anew corona- 

virus centred in Wuhan, China, 

researchers at the virologi- 

cal-analysis website Nextstrain 
were ready. The open-source project tracks the 
spread of viruses through genetic variations 
inthe sequences that scientists find. After five 
years of development and operation, Next- 
strain had team members on three continents 
who could continuously refresh the analysis, 
24 hours a day. 

What they didn’t know was whether 
researchers would share their data. “You 
just never know what level of detail is going 
to be allowed to come out,” says Emma Hod- 
croft, a Nextstrain developer and molecular 
epidemiologist at the University of Basel in 
Switzerland. 

But since 11 January, when a team led by 
Zhang Yong-Zhen at the Shanghai Public 
Health Clinical Center, China, shared the first 
genome sequence of the SARS-CoV-2 virus, the 
volume of data has skyrocketed. By the end 
of March, Nextstrain was receiving anywhere 


from 50 to 200 sequences a day from labora- 
tories around the world, and was running its 
analysis of virus evolution every few hours. 
“The volume that we're getting right now, this 
is totally unprecedented,” says Hodcroft. 

Nextstrain is just one example of how an 
open ethos has driven the scientific response 
tothe COVID-19 pandemic. Academics, online 
data repositories and home hobbyists with3D 
printers are adopting new practices of rapid 
data sharing and collaboration that are appro- 
priate tothe urgency of the crisis. Many hope it 
will change the way science is done even after 
the pandemic subsides. 


Do it yourself 
Perhaps nowhere is that open ethos clearer 
than inthe way do-it-yourself (DIY) and ‘maker’ 
communities have stepped up. As soonas it 
became clear that health systems around the 
world were at risk of running out of crucial 
equipment to treat people with COVID-19 and 
protect medical workers, DIY-ers set about try- 
ing to close the gap. 

Facebook groups such as Open Source 
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COVID19 Medical Supplies, which has more 
than 70,000 members, have become dispatch 
centres, through which hospital workers seek 
volunteers to print or make supplies, and vol- 
unteers trade tips on what materials to use 
and where to source them, and onsterilization 
procedures. 

The coronavirus crisis plays to 3D printing’s 
strong points — rapid prototyping and the abil- 
ity to produce parts on demand anywhere in 
the world. Prusa Research, a manufacturer of 
3D printers in Prague, has designed a frame for 
a face shield that is meant to be worn outside 
a mask or respirator to protect against infec- 
tious droplets. The company says it has the 
capacity to produce 800 shields per day, and 
tens of thousands of the devices are already 
protecting health-care workers in the Czech 
Republic. But because the company made its 
designs open-source, they are also being made 
around the world in maker spaces and homes. 

Formlabs, a3D-printer manufacturer based 
in Somerville, Massachusetts, leads another 
project that has reached production: print- 
ing nasal swabs for COVID-19 test kits. Unlike 
common cotton swabs, nasal swabs must 
have a rod that is long and flexible enough 
to reach deep into the nose, to the upper 
throat. The swabs were designed by doctors 
at the University of South Florida in Tampa 
and the Northwell Health hospital system in 
New York, using printers purchased from the 
company to produce test versions. “They are 
prototyping it themselves, whichis crazy and 
really awesome,’ says Formlabs’s chief product 
officer, David Lakatos. And whereas conven- 
tional swabs feature a bushy tip coating of 
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nylon flock, the doctors devised a tip with an 
intricately textured pattern that is 3D-printed. 
But unlike face shields, these parts are 
beyond the capabilities of most printers used 
by hobbyists. “If someone tried to print the 
swabs ona hobbyist printer, they can really do 
harm’ ina clinical setting, says Lakatos. 
Under US regulations, commercial manu- 
facturing of nasal swabs must take place ina 
facility that has been registered with the US 
Food and Drug Administration (FDA). Form- 
labs has its own registered lab in Millbury, 
Ohio, with 250 printers (each costing about 
US$3,500) that can print 100,000 swabs a day. 


The right tool 


Other 3D-printed and DIY projects seek to pro- 
vide everything from protective face masks 
for medical workers to door handles that can 
be opened using an elbow — helping health- 
care staff to avoid contaminating their hands 
—and ventilators for people who are critically 
ill. Among the furthest along in development 
are the OpenLung ventilator —a collaboration 
between groups based in Toronto, Canada, and 
Dublin — and the MIT Emergency Ventilator 
developed at the Massachusetts Institute of 
Technology in Cambridge. But manufactur- 
ing of such devices is still subject to regula- 
tory approval. The MIT team told Nature that 
“approval would be sought by a manufacturer 
that ultimately adapts and makes a device 
inspired by the open-source reference mate- 
rial”. On17 April, the first such device, called 
Spiro Wave, received Emergency Use Author- 
ization from the FDA, making it available for 
use during the crisis; New York City has already 
ordered 3,000 units. 
In the United Kingdom, a collaboration 
between University College London (UCL), 
the UCL Hospital and the Mercedes Formula 
One racing team has reverse-engineered and 
optimized a ‘continuous positive airway pres- 
sure’ device. The design has been approved 
by UK authorities for use during the COVID- 
19 pandemic and made available at no cost to 
manufacturers and researchers. The National 
Health Service has ordered upto 10,000 units. 
And in Nigeria, Yunusa Mohammed Garba, a 
researcher at Gombe State University, has built 
a positive-pressure ventilator from hobbyist 
and second-hand components, for use in the 
northeastern Gombe state, a resource-con- 
strained part of the country. Nigeria has a pop- 
ulation of about 200 million, yet it might have 
fewer than 500 ventilators. Garba’s design 
is currently being optimized and tested for 
use at the Federal Teaching Hospital Gombe, 
which plans to obtain two devices. “At the 
moment [the ventilator] can only be used in 
the ambulance,” Garba says. “We are currently 
using funding from the government to build 
an upgraded version of the ventilator that can 
be fully utilized in the hospitals.” 

Still, even proponents of 3D printing find 
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some of the projects potentially dangerous. 
“It’s both inspiring and extremely scary,” says 
Lakatos. Formlabs, for instance, investigated 
face-mask designs and produced numerous 
prototypes before recommending against 3D 
printing them. “The [3D-printed] face masks 
that I’m seeing, those designs are absolutely 
not sealing anything,” says Lakatos. “And I 
think they may be even giving a false kind of 
confidence to people.” 

Following discussions with clinicians, Form- 
labs has instead been recommending a DIY res- 
pirator design produced by Boston Children’s 
Hospital in Massachusetts that repurposes off- 
the-shelf parts, including ventilator filters and 
aface mask used for administering anaesthetic 
gas. “It seems to be a much better solution than 
trying to doit with 3D printing,” says Lakatos. 

One of the most widespread open-source 
face-shield designs eschews 3D printing 
entirely. The project began in March with 
the University of Wisconsin Makerspace in 
Madison, which worked with Midwest Pro- 
totyping, a3D-printing company in nearby 


“It’s both inspiring 
and extremely scary.” 


Blue Mounds, to produce them. But after 
bringing in Jesse Darley at the Madison office 
of Delve, an engineering design firm, the 
group decided to change tack. Instead of 3D 
printing, the frames and straps of the resulting 
‘Badger Shields’ (named after the university’s 
mascot) are made from elastic and foam that 
can be purchased off-the-shelf in bulk form, 
and cut down either by machine or by hand. 
Darley says such components can be made 
in 20 seconds, compared with several hours 
through 3D printing. 

The Madison group has already received 
orders for five million shields. To meet that 
demand, manufacturers have stepped in to 
help, including Ford Motor Company, which 
Darley says has tweaked the design for mass 
production and can make around one million 
shields per week. 


Where credit’s due 

The open ethos is influencing other aspects 
of the pandemic response, too. More than 
2,000 articles on COVID-19 have been posted 
inthe preprint archives bioRxiv and medRxiv, 
according to biomed-sanity.com, a site that 
aggregates preprints related to the pandemic. 
Numerous COVID-19 data sets are available on 
the code-sharing site GitHub, including the 
data underlying Johns Hopkins University’s 
widely used COVID-19 case-tracking dash- 
board. So, too, are reviews of the COVID-19 
literature by researchers at the Icahn School 
of Medicine at Mount Sinai in New York City, 
and separately by a collaborative project led 
by computational biologists Halie Rando and 
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Casey Greene at the University of Pennsylvania 
in Philadelphia. 

Yet it wasn’t a given that researchers would 
embrace openness early in the outbreak: 
data that are made public can be difficult to 
publish through conventional channels later. 
And multiple news reports have suggested 
that health workers and researchers in China 
were initially subjected to government limits 
on what information they could release. But 
when Chinese researchers uploaded the first 
genome sequence of the SARS-CoV-2 virus to 
the online repositories virological.org and 
GenBank, they opened the floodgates for more 
sequences from China and from the rest of the 
world, Hodcroft says. “Iam very grateful for 
the scientists who took this risk, because I 
think this set the precedent for the rest of the 
epidemic.” Given that the outbreak was initially 
confined to China, had those researchers not 
done so, “we might have completely different 
pictures that might be incorrect” she says. 

Hodcroft hopes that these collabora- 
tive practices will carry over to research on 
other viruses and seasonal outbreaks. Not 
all labs have the equipment and personnel to 
sequence a viral genome, and even for those 
that do, the work requires time and money. 
But if more take that step — even if only every 
couple of weeks — she says it should be possi- 
ble to track outbreaks in greater detail, using 
mutations as markers to better understand 
their geographical spread. 

The pandemic could also bring lasting 
changes in how medical equipment is devel- 
oped, produced and distributed. Lakatos 
would like to see hospitals have their own 3D 
printers as an emergency back-up to produce 
crucial equipment such as nasal swabs. And 
publicly available designs for parts such as 
face shields could make it easier to overcome 
breakdowns in international supply chains, 
allowing for more flexible, distributed manu- 
facturing, says Darley. 

Soon after releasing the Badger Shield plans, 
Darley was contacted by acompany in Bonner, 
Montana, he had worked with that makes cycle 
rickshaws. The firm, called Coaster Cycles, had 
laid off or drastically cut hours for nearly all its 
workers because of the pandemic, but after 
seeing the open-source plans, it wonacontract 
to supply shields to health systems spanning 
six US states — and hired back its workforce 
to produce them. Chief executive Ben Morris 
says the company eventually hopes to sell one 
million face shields. “That’s the power of open 
source,” says Darley. “It allows a family to make 
afew [units], ora manufacturer to make thou- 
sands, or hundreds of thousands.” 


Mark Zastrow is a writer based in Seoul, 
South Korea. 


Additional reporting by Abdullahi Tsanni in 
Abuja, Nigeria. 
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interview people who lived in the 
past. I’m an anthropologist at Tel Aviv 
University in Israel, but I call myselfa 
biohistorian. I’m trying to understand 
human history, not from human 
artefacts, temples or big walls surrounding 
old cities, but from bones. They record every 
important aspect of your life: what you eat, 
your ailments, your broken hand at age five. 
My main interest is the origin of our own 
species, Homo sapiens: how we evolved, 
and how we interacted with other Homo 
groups, such as Homo neanderthalensis. In 
this photograph, taken in the preservation 
laboratory where we first clean our finds, 
I’m using a pneumatic drill to slowly expose 
the bones of ahuman who lived between 
100,000 and 120,000 years ago. anda 
colleague discovered them ina small cave 
called Tinshemet, located near the city of 
Shoham in central Israel. It is also known 
as Mugharet Al Watwat in Arabic, meaning 
‘cave of the bat’. The block, which my team 
removed from the cave, is a half-tonne piece 
of breccia, an accumulation of sedimentary 
fragments of rock or mineral debris. 
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This is probably one of the most 
important human fossils ever found — the 
earliest intentional burial that we know of. 
The skeleton is fully articulated and lies on 
its side in the fetal position, which suggests 
that the burial was organized. We don’t know 
yet if it’s amodern human, an archaic Homo 
or a Neanderthal, because it will take at least 
18 months to expose the bones. 

Right now my lab is closed; nevertheless, 
I continue to work from home at full speed. 
Anthropology requires the collaboration of 
scientists from different disciplines, and we 
do Zoom meetings almost every day. 

In this work, your fossil becomes your 
best friend. You want to know them, to get 
excited about every discovery and new 
morphological trait. It’s very relaxing to 
work slowly in a quiet environment, not 
disturbed by people or external noise. This 
is what I love the most. 


Israel Hershkovitz is director of the Dan David 
Center for Human Evolution and Biohistory 
Research at Tel Aviv University. 

Interview by Josie Glausiusz. 


